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Preface 


A first draft of this book was written in August 2003. It was based on two sets of 
lecture notes by Yuri Bilu. In the following years, the draft was used in student 
seminars in Rome, on the island of Minorca, and in Regensburg. With the benefit of 
these experiences, a new version was written. It is the basis of this book. 

The notes contain a complete proof of Catalan’s conjecture. To read the first 
few chapters requires little more than a basic mathematical background and some 
knowledge of elementary number theory. The other chapters involve Galois theory, 
some more algebraic number theory, and a little bit of commutative algebra [1]. The 
basic facts from the arithmetic of cyclotomic fields are discussed in the text. This 
material can also be found in the textbooks Introduction to Cyclotomic Fields by 
L.C. Washington [50] and Cyclotomic Fields by S. Lang [25]. 

Our exposition is self-contained with one small exception. This regards 
chapter 14. Here we explain an argument of Mihailescu’s that is based on Francisco 
Thaine’s famous theorem. Our proof of Thaine’s theorem involves an application of 
Chebotarév’s density theorem to the Hilbert class field of a cyclotomic field. While 
we do provide a proof of Chebotarév’s theorem, we do not prove the existence and 
the basic properties of the Hilbert class field. A proof would involve a good deal of 
class field theory, and this is not included in these notes. We hope instead that the 
present application motivates the interested reader to study class field theory. This 
theory is exposed in the textbooks Algebraic Number Theory by J.W.S. Cassels and 
A. Frohlich [10] and Algebraic Number Theory by S. Lang [24]. 

I would like to thank Leonardo Cangelmi, Alessandro Conflitti, Jeanine Daems, 
Carlo Gasbarri, Danny Gomez, David Kohel, Hendrik Lenstra, Preda Mihailescu, 
Peter Stevenhagen, Andrea Susa, Michiel Vermeulen, Valerio Talamanca, Michael 
Tse, Filippo Viviani, Larry Washington, and Gabor Wiese for their remarks on ear- 
lier versions of these notes. I especially thank Yuri Bilu, whose 2002 manuscript [3] 
and 2003 Oberwolfach lecture [4] were very useful to me. 


Rome René Schoof 
May 2007 


Leitfaden 


If you know algebraic number theory and the theory of cyclotomic fields and are 
only interested in Mihailescu’s proof of Catalan’s conjecture, read chapters 1, 7, 8, 
10, 11, 12, and 14. If you want to see a proof from scratch of Catalan’s conjecture, 
then also read chapters 2, 3, 4, and 6. Chapters 5, 9, 13, 15, and 16 deal with some of 
the more advanced prerequisites. Here we discuss Runge’s method, Stickelberger’s 
theorem, basic properties of semisimple group rings, the Chebotarév density 
theorem, and Thaine’s theorem, respectively. 
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1 
Introduction 


In this book, we present Preda Mihailescu’s beautiful proof of the conjecture 
made by Eugéne Charles Catalan in 1844 in a letter [11] to the editor of Crelle’s 
journal: 


Je vous prie, Monsieur, de vouloir bien énoncer, dans votre recueil, le théoréme suivant, 
que je crois vrai, bien que je n’aie pas encore réussi 4 le démontrer complétement: d’ autres 
seront peut-étre plus heureux: 

Deux nombres entiers consécutifs, autres que 8 et 9 ne peuvent étre des puissances 
exactes; autrement dit: l’équation x” — y? = 1 dans laquelle les inconnues sont entiéres et 
positives, n’admét qu’une seule solution. 


In other words, Catalan proposed the following. 


Conjecture (E. Catalan, 1844) The only two consecutive numbers in the sequence 
of perfect powers of natural numbers 


1, 4, 8, 9, 16, 25, 27, 32, 36, 49, 64, 81, 100, 121, 125, 128, 144, 169,... 
are & and 9. 


When k > 2 is fixed, the kth powers of natural numbers are necessarily far 
apart. However, when one varies k, two powers can be closer to one another than 
one might expect. For instance, we have 27 — 5° = 128 — 125 = 3 and 13° — 37 = 
2197—2187 = 10. Catalan conjectured that the only powers for which the difference 
is as small as 1 are 37 and 23. 

Phrased in yet another way, Catalan conjectured that for exponents p, g > 2, the 
Diophantine equation 


xP —y? =] 


admits no solution in natural numbers other than the one given by x = 3, p = 2 and 
y=2,q=3. 
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2 Catalan’s Conjecture 


Apparently, Catalan himself did not get very far in solving the problem. We read 
this in a note [12] that was published more than forty years after his 1844 letter. 
Here Catalan reports on his early attempts: 


Apres avoir perdu prés d’une année a la recherche d’une démonstration qui fuyait toujours, 
j abandonnerai cette recherche fatigante. 


The Société Belge des Professeurs de Mathématique d’Expression Francaise has 
published a book on the academic and political activities of Eugéne Catalan [20]. It 
contains a reproduction of a painting of Catalan which is at present in the possession 
of the Université de Liége (Fig. 1.1). 

In this book, we mainly concentrate on Preda Mihailescu’s proof of Catalan’s 
famous conjecture. We discuss earlier work only when it is relevant to our 


Fig. 1.1 Eugéne Catalan (1814-1894) 
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Fig. 1.2 Rob Tijdeman. (Reproduced by the kind permission of Rob Tijdeman.) 


presentation of the proof. For an overview of earlier work on the conjecture, see [7, 
14, 15, 35, 38]. We only mention one important result: In 1976, Rob Tijdeman [49] 
(Fig. 1.2) showed that there exist only finitely many pairs of consecutive perfect 
powers. His proof is based on the theory of linear forms in logarithms. Unfortu- 
nately, the bound on the size of the solutions that came out of Tijdeman’s proof is 
astronomical. There remained a large gap between the relatively small exponents 
p,q for which Catalan’s equation had been solved and Tijdeman’s estimates. In the 
successive years, this gap was narrowed considerably by various people [2, 6, 17, 
19, 26, 32, 33, 34, 44, 46]. But the gap remained very large. In 1999, refinements 
of Tijdeman’s estimates were shown to imply Catalan’s conjecture when both expo- 
nents p,q exceed 7.78 - 10'°. On the other hand, elaborate computer calculations 
had proven the conjecture when one of p,q is smaller than 10°. See [34] for more 
information. 

Between 2000 and 2003, Preda Mihailescu (Fig. 1.3) proved three theorems 
concerning Catalan’s conjecture: 

Let p,g be odd primes and suppose that x,y are nonzero integers for which 
PS y= I, 


Theorem I (P. Mihdilescu, 2000) We have 


p’-!| = (mod q?) and q?~! = (mod p’). 
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Fig. 1.3. Preda Mihailescu. (Reproduced by the kind permission of Robert Tichy.) 


Theorem II (P. Mihdilescu, 2002) We have 


p=1(modgq) or gq = I|(mod p). 


Theorem III (P. Mihdilescu, 2003) We have 
p< 4q? and q< Ap’. 

Mihailescu’s proofs of Theorems I, I, and III appear in [35], [36], and [37], 
respectively. We show now that these three theorems together lead to a proof 
of Catalan’s conjecture. This proof does not rely on Tijdeman’s work or on the 
computer calculations mentioned above. 


Main theorem The only solutions of the equation 


xP —y? =] 
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in integers p,q > 2 and nonzero integers x, y are given by p = 2, q = 3 and 
Hs y= 2. 


Proof tis indeed true that (+3) — 2? is equal to 1. To prove that there are no other 
solutions, it suffices to show that there are no other solutions when the exponents p 
and q are prime numbers. See Exercise 1.1. 

The case g = 2 was taken care of by V. Lebesgue [27] in 1850. The case p = 2 
was dealt with by Ko Chao [21] in 1965. See chapters 2 and 3 for detailed proofs of 
these two results. 

Therefore, we may assume that both p and g are odd prime numbers. We suppose 
that x,y are nonzero integers satisfying x? — y? = | and we derive a contradiction 
from this assumption. The problem is symmetric in p and q, as one sees by replacing 
(x, y, p,q) by (—y, —x, q, p) in the equation x? — yf = 1. 

By Theorem II, we have either p = I(mod q) or g = |(mod p). By symmetry, 
we may assume that p = 1(mod q). It follows then from Theorem I and Exercise 1.2 
that one even has p = 1(mod gq”). Therefore, p = 1 + kq? for some integer k > 1. 
Theorem III implies then that k < 3. Since both 1 + q* and 1 + 3q? are even and 
hence cannot be prime, we must have k = 2. Since p = 2q? + 1 is divisible by 
3 when g 4 3, we must have gq = 3. This implies p = 19. But this is impossible 
because we have 3!8 4 1(mod 197), contradicting Theorem I. 

This proves the main theorem. 


Theorem I is Mihailescu’s so-called double Wieferich criterion [35]. Proved in 
2000, it shows that if exceptions to Catalan’s conjecture exist, they are very rare 
indeed. But double Wieferich pairs do exist: p = 83, g = 4871 is an example. 
See Exercise 1.4 for other examples. Mihailescu’s main result [36] is Theorem II. In 
combination with certain estimates from the theory of linear forms in logarithms and 
a computer calculation, it led in 2002 to a complete proof of Catalan’s conjecture. 
To reach this point, Yuri Bilu’s (Fig. 1.4) efforts to understand Mihailescu’s original 
proof have been of great importance [3]. The proof that we present here avoids both 
the theory of linear forms in logarithms and the computer calculation. It replaces 
these by Mihailescu’s Theorem III, the proof of which involves only the arithmetic 
of cyclotomic fields. 

The proofs of Theorems II and III that we give work only when p,q are not very 
small. To take care of small exponents, we use the following result. We present 
Mihailescu’s proof [37] of it in chapter 8. This result is much stronger than what we 
need to make the proofs of Theorems II and III work. We need it only for p or q less 
than or equal to 5. 


Theorem IV_ Let p,q be odd primes. Suppose that we have p < 41 org < 41. 
Then Catalan’s equation x”? — y4 = | admits no nonzero solutions x, y € @. 


There are two main ingredients in the proofs of Theorems J-IV. The first one 
is Runge’s method [40]. This method can be said to have already played a role 
in the proofs of certain earlier results regarding Catalan’s conjecture. The second 
ingredient is the theory of cyclotomic fields. In particular, Stickelberger’s theorem 
and Thaine’s theorem play central roles in the proofs. As we already pointed out, 


6 Catalan’s Conjecture 


Fig. 1.4 Yuri Bilu. (Reproduced by the kind permission of Francine Delmer.) 


Mihailescu’s proof does not make any use of Tijdeman’s result [49] and does not 
rely on any computer calculation. 

The first few chapters of this book regard relevant earlier work. In chapter 2 
and 3, we discuss Catalan’s equation x? — y? = | when g = 2 and p = 2, respec- 
tively. These results are very classical [21, 27]. Chapter 4 is devoted to the Diophan- 
tine equation x? — y* = 1, which Euler solved in 1738. We solve it following W. 
McCallum [31], who dealt with the problem as a student participating in a 1977 
honors project at the University of New South Wales. We are then reduced to the 
case where both p and g are odd primes. In chapter 5, we discuss C. Runge’s method 
to effectively bound the integral solutions of certain Diophantine equations. 

In chapter 6, we use this method to prove J.W.S. Cassels’ 1960 result [8], which 
says that when x,y are nonzero integers satisfying x? — y? = 1, then gq divides x 
and p divides y. We show that Cassels’ theorem easily implies that any nonzero 
solution x, y € Z to Catalan’s equation is necessarily very large with respect to the 
exponents p and g. 
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The remaining chapters regard Mihailescu’s proofs and involve the theory of 
cyclotomic fields. Our presentation of Mihailescu’s work is as follows [43]. To a 
nonzero solution of Catalan’s equation x” — y4 = 1 we associate the element x — ¢,, 
of a certain obstruction group H. Here ¢, denotes a primitive pth root of unity and 
the group H is defined as 


H= {ox € Q(¢,)* : ord.(a) = O(mod q) for all prime ideals t 4 p} /Q(op)*4, 


where p denotes the unique prime ideal of the ring Z[¢,,] lying over p. In chapter 7, 
we explain that the group 4H is finite and that there is a natural exact sequence of 
F,[G]-modules 


O: 2. Spf EP. Se OE “Ee higd  20 


where G is the Galois group of Q(¢,) over Q and E, and Cl, denote the group 
of p-units and ideal class group, respectively, of the cyclotomic field Q(¢,). See 
chapter 7 for more details. 

On the one hand, the fact that x — ¢, comes from a solution to Catalan’s equation 
is shown to imply that the F,[G]-submodule of H generated by x — ¢, is large 
in various senses. For instance, Cassels’ result mentioned earlier can be viewed as 
saying that x —¢, is not contained in a certain index-g submodule of H. Theorem 8.3, 
Proposition 11.3, and Theorem 12.4 each state that the F,[G]-submodule of H 
generated by x — ¢, is large in a certain sense. 

On the other hand, the general theory of cyclotomic fields implies that the F,[G]- 
module generated by x — ¢, is also small from various points of view. Indeed, 
Stickelberger’s classical theorem and Thaine’s 1988 theorem provide elements in 
the group ring Z[G] that annihilate the obstruction group H or at least certain parts 
of it. For instance, Mihdilescu’s “double Wieferich criterion” is proved using Stick- 
elberger’s theorem. His result, which is our Theorem I, can be viewed as saying 
that x — ¢, is contained in a proper submodule of the obstruction group H. More 
precisely, x — ¢, is contained in the “Selmer group” S defined by 


S = {a € H : a@ isa q—adic gth power for each prime q lying over q}. 


Corollary 10.3, Proposition 11.2, and Theorem 14.1 each state that the F,[G] 
module generated by x — ¢, is small in a certain sense. 

Confronting both type of statements leads to contradictions. The conclusion is 
then that Catalan’s equation x? — y4 = 1 does not admit any nonzero solutions 
other than the solution (+3)? — 23 = 1. 

In chapter 8, we prove Theorem IV, while in chapter 9, we discuss some of 
the basic properties of the Stickelberger ideal. These are then used in chapters 10 
and 11, where we prove Theorems I and III, respectively. The remaining chapters 
are devoted to the proof of Theorem II. The relevant Runge argument is given in 
chapter 12. In chapter 13, we discuss some elementary properties of semisimple 
group rings. In chapter 14, we exploit these in the proof of Theorem II. The proof 
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involves Thaine’s theorem, which we prove in chapter 16. Our proof of Thaine’s 
theorem makes use of Chebotarév’s density theorem, which we prove in chapter 15. 


Exercises 


1.1 Show that it suffices to prove Catalan’s conjecture for prime exponents: If the 
only solution to x” — y? = | in nonzero integers x,y and prime numbers p,q is 
the one given by (+3)? — 2? = 1, then Catalan’s conjecture is true. 

1.2 Let q be prime and suppose x € Z satisfies x = 1(mod q). Show that x7~! = 
1(mod q?) implies that x = 1(mod q?). 

1.3 Check that 3'8 4 1(mod 197). 

1.4 Show that the pairs of primes (p, g) = (2, 1093) and (911, 318917) are double 
Wieferich pairs, i.e., they satisfy the congruences of Theorem I. 

1.5 A proper power is a natural number of the form n* for some natural number n 
and some exponent k € Zs. 


(a) Show that every m ¥ 2(mod 4) is the difference of two proper powers. 
(b) (Research problem) Investigate whether m = 2, 6, 10, 14, 18, ... are differ- 
ences of proper powers. 


2 
The Case ‘‘q = 2” 


In this chapter, we deal with the case where the exponent g in Catalan’s equation 
xP — y? = | is equal to 2. The proof is by a 2-adic argument [27]. It exploits the 
arithmetic of the ring of Gaussian integers Z[i]. See Exercise 2.2. 


Proposition 2.1. (V.A. Lebesgue, 1850) For any exponent p > 2, the Diophantine 
equation 


xP=y4) 


has no solution in nonzero integers x, y. 


Proof Suppose x, y € Z satisfy x? = y? +1. If p is even, we have (x?/? — y) 
(x?/? + y) = 1 and hence x?/? — y = x?/* + y = +1. Subtracting the equations, 
we find that y = 0. Since y 4 0, we may therefore assume that p is odd. 

Considering the equation modulo 4, we see that y is even and x is odd. See 
Exercise 2.1. It follows that 1 + iy and | — iy are coprime in the ring Z[i]. By 
Exercise 2.2, the ring Z[i] is a unique factorization domain with precisely four units. 
Since p is odd, it follows that the units of Z[i] are pth powers. Since (1 +iy)(1 —iy) 
is a pth power, Exercise 2.3 then implies that 


l+iy=c?, for some c € Z[i]. 
Adding the complex conjugate equation, it follows that 


2= cP +0? = (c+ OCP cP CEH. EPS)” 


Therefore, the integer c+ divides 2. Since c+ is even, it is equal to +2. It follows 
that c = +(1 + bi) for some b € Z. Moreover, since y is even, Exercise 2.2(d) 
implies that c is not divisible by 1 + i and that b is also even. We get 


(14 bi)? + (1 — bi)? = +42. 
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Reducing this equality modulo the ideal 8Z[i], we see that we have the plus sign 


on the right. Using the binomial expansion, we find the following relation among 
binomial coefficients (?): 


(2) en? + ({ Jon" Ale peat (ee oar ety 


Suppose b ¥ 0. See Exercise 2.5 for the basic properties of p-adic valuations. We 
claim that the 2-adic valuation of the first term is strictly smaller than the 2-adic 
valuation of each of the other terms. In other words, we claim 


ord, ((? ) wir’) > ord, ((! Jon) 


for all even k satisfying 4 < k < p — 1. This follows from the fact that the 2-adic 


valuation of 
P\(P k2 __ (PP 2 2 »\k—2 
(7) or Na =(j- ) ke? 


is strictly positive. Indeed, by Exercise 2.4, the binomial coefficient (G2) is an 
integer and, by Exercise 2.6, we have for all even k > 4, 


“\k—-2 logk 
ord,(2(bi)*“) >k -—1> aa > ords(k) = ordo(k(k — 1)). 
og 


A repeated application of Exercise 2.5 now implies that ord2((4) (bi)*) is equal to the 
2-adic valuation of the entire sum, which is zero. This is only possible when b = 0. 
But then we have c = | and hence y = 0. This proves the proposition. 


Exercises 


2.1 Letx € Z. Show that x? is congruent to 0 or | (mod 4). Show that x? +2 cannot 
be a square modulo 4. 
2.2 Let Z[i] be the subring of C defined by Z[i] = {a + bi: a,b € Z}. 


(a) Show that Z[i] is a principal ideal domain and hence a unique factorization 
domain. 

(b) Show that the unit group Z[i]* is equal to { i}. 

(c) Show that the Z[i]-ideal generated by 1 + i is forme and that its square is 
generated by 2. 

(d) Show that a + bi is divisible by | +7 if and only ifa =b mod 2. 


on 
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2.3 


2.4 


25 


2.6 


Let n > | be an integer. 


(a) Let R be a unique factorization domain. Suppose that a, b are coprime 
elements of R with the property that ab = c” for some nonzero c € R. 
Show that there are r,s € R anda unit u € R* for which a = ur” and 


b= u's". 

(b) Take R = Z and n odd in part (a). Show that a = r” and b = s” for certain 
r,séZ. 

(c) Do the same for R = Z[i]. 

Letn € Z. 


(a) Show that for any n € Z.o and any natural number k, the binomial coeffi- 
cient 


n one nl) ke) 
(i) - k-(k—1)--+1 


is an integer. 
(b) Show that for n > 0, we have )~;_, (") = 2” and hence (7) < 2” for all 
O<k<n. 


Let p be a prime. For any nonzero x ¢€ Z, let ord,(x) denote the p-adic 
valuation of x. In other words, ord,(x) is the number of factors p that occur 
in the prime factorization of x. For x € Q*, we write x = y/z for some 
y,z € Z and set ord,(x) = ord,(y) — ord,(z). It is convenient to set 
ord,(0) = +00. For x, y € Q, show that ord,(xy) = ord,(x) + ord,(y). Show 
that ord,(x + y) > min(ord,(x), ord,(y)), with equality holding when ord, (x) 
and ord,(y) are distinct. 

Let p be a prime and let n be a natural number. Show that ord,(n) < logn/ log p, 
with equality holding when n is a power of p. 


3 
The Case ‘Sp = 2” 


In this chapter, we deal with the case where the exponent p in Catalan’s equation 
x? — y? = 1 is equal to 2. This was first done in 1965 by Ko Chao [21]. The proof 
we present here is due to E.Z. Chein [13]. In chapter 5, we give an alternative proof 
of Lemma 3.2, based on Runge’s method. 


Lemma 3.1 Let q > 3 be an odd integer and suppose that x, y are nonzero 
integers satisfying x> — y? = 1. Then 


(i) replacing x by —x if necessary, we have 
x—-1l= 291!q4, 


x+1= 2b%, 
y = 2ab, 


for coprime integers a,b € Z satisfying gcd(2a, b) = 1; 
(ii) we have y > 247! — 2. 


Proof (i) We have (x — 1)(x + 1) = y’. If x is even, then the factors x + 1 of 
x? — | are coprime and, by Exercise 2.3, both x — 1 and x + 1 are qth powers. 
Since these gth powers differ only by 2, they are equal to +1 and we have x = 0, 
which is not the case. Therefore, x is odd and hence y is even. It follows that 24 
divides (x — 1)(x + 1). Changing the sign of x if necessary, we may assume that 
x = 1 (mod 4). This implies that (x + 1)/2 is odd and hence that 27—! divides x — 1. 
Since gcd(x — 1, x + 1) is equal to 2, it follows that the two factors on the left-hand 


side of the equality 
x—1 oe (2)' 
24-1 i aa) 


are coprime integers. By Exercise 2.3, both (x — 1)/24~! and (x + 1)/2 are qth 
powers: (x — 1)/27-! = a4 and (x + 1)/2 = D4 for certain a, b € Z. Moreover, b is 
odd and we have gcd(a, b) = 1. This proves (i). 
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To prove (ii), we subtract the first two equations of part (i) from one another. 
We find 2b? = 2 (mod 27~') and hence b4 = 1 (mod 27-7). Since q is odd and 
the order of the group (Z/24~?Z)* is a power of 2, this implies b = 1 (mod 24~”). 
Since b # 1, we have therefore that |b| > 27~* — 1. From the fact that y is positive, 
it follows then that y = 2ab > 24~! — 2, as required. 

The inequality of part (ii) is used in chapter 5 and plays no role here. The 
following two lemmas imply Ko Chao’s theorem. 


Lemma 3.2 Let g > 3 be prime and let x,y be nonzero integers satisfying 
x? — y? = 1. Then we have x = 0 (mod q). 


Proof Suppose that we have x* — y? = 1 but x 4 0 (mod q). We have 


—) 2: 
+ 1) { —)=x~’. 
(y (= 


By Exercise 3.2, the gcd of the factors y + 1 and (y? + 1)/(y + 1) divides g. Since 
x 4 0 (mod q), the gcd is therefore equal to 1. Exercise 2.3 implies then that each of 
the factors is a square times +1. Since y4 + | is a square, we have y > 0. Therefore, 
both y + | and (y? + 1)/(@ + 1) are positive. It follows that 


y+l=wv for some nonzero u € Z. 


Therefore, both (X,Y) = (x, y4~!/?) and (X,Y) = (u, 1) are solutions to the 
equation 


X?-y-Y=1. 


Since y = u? — 1 is not zero, it must be positive. By Exercise 3.4, the number y 
is not a square, and so we are dealing with a Pell equation. The ring Z[,/y ] is a 
subring of the ring of integers of a real quadratic number field and x + y@~)/? /y 
is a unit of Z[./y ]. By Exercise 3.4, the unit group of Z[/y ] is generated by —1 
and u + ,/y. It follows that 


x+yO-DP fy = +(ut fy)” for a certain m € Z. 


Since (u + iD me = —(—u + ,/y), we may change the sign of u and assume that 
m > 0. Reducing the equation modulo the ideal yZ[/y ], we find 


+(u" + mu™—'./y) (mod yZL/Y ]). 


Se 
Il 
| 


Since the elements 1, ./y form a Z-basis for the additive group of Z[/y], 
it follows that mu”! = 0 (mod y). This implies that y divides mu’~!. Since 
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y +1 = wu’, Exercise 2.1 implies that y is even and u is odd. It follows that m 
is even. Therefore, we have 


x+yC bP? fy = +(u? + y + Quy)”, 


Reducing this equation modulo the ideal uZ[,/y ], we find 


x+y? fy = ty" (mod uZL/y)). 
Inspection of the ,/y-coefficients shows that u divides y4-D/?, Since y+ 1 = uv’, 
we have gcd(u, y) = 1. It follows that vu = +1 and hence y = 0. 
This contradiction shows that x = 0 (mod q), as required. 


Lemma 3.3 Let gq > 3 be prime and let x,y be nonzero integers satisfying 
x?—y1=1. Then we have x = +3 (mod q). 


Proof In view of the previous lemma, we may assume q > 5. Changing the sign 
of x if necessary, Lemma 3.1 implies x — 1 = 27~!a4 and x + 1 = 254 for certain 
a,b € Zwith gcd(2a, b) = 1. It follows that we have 


2 2 
b4 — (Qa) = (+) GS (>) 


Factoring the left-hand side, we find 


‘ b*4 — (2a)! x= 3\" 

o? 2 (FHS) = (S). 
Since 2a and b? are coprime, Exercise 3.2 implies that the gcd of the two factors 
divides g. We claim that the gcd is actually equal to qg. Indeed, if it is 1, Exercise 2.3 
implies that both factors are equal to a square times +1. Since b74 — (2a)? is a 
square, we have b24 > (2a)? and hence b? > 2a. It follows that we have the plus 
sign and that b* — 2a is a square. Since y is not zero, neither is a. This means that 
b* — 2a = c? for some square c’ that is different from b*. The squares nearest to b* 
are (b + 1)”. Since |(b + 1)* — b?| > 2|b| — 1, we have |c? — b?| = 2|a| > 2|b| — 1 
and hence |a| > |b|. However, from the inequalities g > 5 and |x| > 2, it follows 
that 


|x — 1 |x — 1| |x + 1| 
< < 


lal? = < 
24-1 16 ») 


= |d\*, 


showing that |a| < |b]. 
This contradiction shows that the gcd of b* — 2a and the cofactor (b*4 — 
(2a)1)/(b* — 2a) cannot be 1 and therefore must be equal to g. As a consequence, 


eS) is divisible by g so that x =3 mod gq, as required. 
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Corollary 3.4 (Ko Chao, 1965) Let q => 5 be prime. Then the Diophantine equa- 
tion 


x --yl=1 


has no nonzero solutions x, y € Z. 


Proof Suppose that x,y € Z is a nonzero solution. By Lemmas 3.2 and 3.3, 
we have x = 0 (mod q) as well as x = +3 (mod q). Since q ¥ 3, this is clearly 
impossible. Therefore, there are no solutions and the result follows. 


Exercises 


3.1 Let n be an odd integer. Show that = ” is positive for any two distinct real 
numbers x, y. 

3.2 Let g be a prime number and let x, y be distinct integers with gcd(x, y) = 1. 
Show that 


q _ v4 
goa (“= x-y) divides q. 


3.3 Let n be a natural number and let x, y be distinct integers that have the same 
sign. Suppose that |x| < |y|. Show that |y” — x”| > n|x|"~!. Show that the 
inequality is strict when n > 2. 

3.4 Letu € Z.; and let y = uz —1. 


(a) Show that y is not a square. 

(b) Show that —1 and e = u + ,/y generate the unit group of the ring Z[,/y]. 
[Hint: Let j : Z[,/y] <> R denote the embedding that maps ,/y to the 
positive square root of y in R. Let 7 be an arbitrary unit of Z[/y]. By 
multiplying by +e* for some k € Z, we may assume that 1 < j(n) < j(e). 
We claim that 7 = 1. To see this, suppose that 7 = a + b,/y for two 
integers a,b > 0. If n 4 1, we have b > 1. Since the norm of 77 is equal 
to +1, this implies that a7 = +1+b*y > y —1 = wu? — 2. Therefore, 
a > J/u2—2 > u —1. It follows that a > u and hence j(n) > j(e), a 
contradiction. ] 


4 
The Nontrivial Solution 


In this chapter we deal with the exponents p = 2 and g = 3. These were not discussed 
in Chapter 3. In this case, Catalan’s equation x* = y? + 1 describes an elliptic curve 
E defined over Q. Standard “descent” methods [52, App. IV, p. 140] show that the 
group E(Q) of rational points is finite and of order 6. Indeed, E(Q) consists of the 
points (x, y) = (0, —1), (41, 0), (43, 2), and the point at infinity. It follows that 
the only integral solutions of the equation x7 — y? = 1 with x, y 4 0 are given by 
(x, y) = (+3, 2), as required. Euler’s proof [16], presented in the Appendix, can be 
interpreted as such a proof by descent. See [52, XVI]. 

In this chapter, we present a different solution. It is due to W. McCallum [31]. 
He reduces the problem to a Thue equation, which he solves by means of Skolem’s 
method. See [9, s. 10.10] for this method. 

McCallum’s method involves arithmetic in the ring ZLV/2]. We introduce some 
terminology. For every element x of the number field Q(./2), there are unique 
a,b,c € Q for which we have x = a+ b/2 + cv/4. We call the coefficient c 
the V/4-coefficient of x. 


Proposition 4.1 Let n be the unit /2 — | of the ring Z[V2]. The only exponents 
n € Z for which the /4-coefficient of n” is zero are n = 0 and 1. 


Proof The multiplicative inverse of 7 is easily checked to be 1+./2+¥/4. For every 
m > 0, the coefficients of (1 4/34 V4) with respect to the basis {1, /2, /4} 


are all positive. This follows easily by induction. Therefore, the \/4-coefficient of 
n” is never zero when 7 is negative. 

From now on we suppose that n > 0. Set 7 = 1 + 2. This number generates 
the unique prime ideal lying over 3 in the ring Z[\/2]. See Exercise 4.3. The proof 
is by a z-adic argument. For every integer k > 0, there are unique ax, by, cx € Q 
such that m* = a, + byJ/2 + ce 4. The coefficients az, bg, cx are actually in Z. For 
k < 6, they are listed in Table 4.1. 


3 
Since 2° is equal to (1 + v2) = 3 + 3/2 + 34/4, it is contained in the 


ideal 3Z[\/2]. Therefore, for every k > 0, the powers z* are contained in the 


R. Schoof, Catalan’s Conjecture, DOI: 10.1007/978-1-84800-185-5_4, 17 
© Springer-Verlag London Limited 2008 


18 Catalan’s Conjecture 


Table 4.1 Coefficients a,, by, and cx 


k 0 1 2 3 4 =) 6 

ak 1 1 1 ) 9 21 45 
by 0 1 2 3 6 15 36 
Ck 0 0 1 3 6 12 27 


ideal 3031Z[./2]. This implies that for every k > 0, the coefficients az, bg, and cx 
are divisible by 3L3]. Here [£] denotes the integral part of K 
It is convenient to consider the number =F = | — 3 rather than the unit 77 itself. 


We have 


Suppose that for some n > 2, the s/4-coefficient of n” vanishes. Then the same is 


true for the ¥/4-coefficient of (-4)". The \/4-coefficient of (—3)" is equal to the 
sum of the series of the \/4-coefficients of the terms of the sum above. Since the 
1 


./4-coefficients of the terms are equal to ) (- 5) Cr, we have 


BOC) 


Since cp = c; = 0, the first two terms of the series vanish. By assumption, we have 
n # 0 or I, so that we can divide by n(n — 1). Since we have 


1 n\ _ 1 n—2 
aal;) > aot) 


we find 


2. 1 fm =2 LF 
tye ac) (-3) cg 


By Exercise 2.4, the binomial coefficients are all integers. The coefficients cz are 
divisible by 315]. Exercise 4.1 implies that for k > 2, the 3-adic valuation of k(k— 1) 
is at most [k/3]. It follows that all terms are rational numbers with denominators 
that are prime to 3. The inequality ord3(k(k — 1)) < [k/3] is strict when k > 5. It 
follows that the terms with k > 5 are congruent to 0 (mod 3). As one can see in 
Table 4.1, the coefficients c2, c3, and c4 are equal to 1, 3, and 6, respectively. 
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Therefore, evaluating the sum modulo 3 and keeping in mind that 5 = 2 (mod 3), 
we find 


fll iy. 4 5 ae 1 n—2)(n-3) 1\". as 
=;( ) +35 i( 5) v3 2 (-3) ee) 


=—1-—(n+1)+ (n+ In (mod 3) 
=n" + 1 (mod 3). 


This is impossible, and the proposition follows. 


Proposition 4.2 The only nonzero solutions x, y € Z to the Diophantine equation 


xy a1 


are given by (x, y) = (43, 2). In particular, the only nontrivial solutions x, y € Z to 
Catalan’s equation with exponents p = 2 and q = 3 are given by (x, y) = (43, 2). 


Proof Let x, y € Zbe a nonzero solution to the equation x* — y? = 1. We have 
@-De+)=y’, 


If x is even, the factors on the left are coprime and Exercise 2.3 implies that x — 1 = 
r> and x +1 = s° for certain u,v € Z. It follows that s> — r? = 2. Exercise 3.3 
shows that s and r cannot have the same sign. It follows easily that s = —r = 1. But 
then we have x = 0, which is not the case. Therefore, x is odd. Changing the sign 
of x if necessary, we may assume that x = | (mod 4). We have 


x-Ix+1 ey 
i eee 


The two factors on the left are coprime integers. Therefore, Exercise 2.3 implies 
that 


x—1 3 
=u” and 
4 


This shows that 


x+1 . 
— v, for certain u,v € Z. 


v—-2w=1. 


In other words, the number ¢ = v — uV2 is a unit of the ring Z[V2]. Exercise 4.4 
implies that 


é= tn”, 


where 1 = /2—1. To see that we have the plus sign, we take norms. By Exercise 4.2, 
the norm from Q(/2) to Q of ¢ = v — uv2 is equal to v> — 2u? = 1. On the other 
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hand, the norm of 7 is 1. Therefore, we have ¢ = +n”. Since the \/4-coefficient of 
€ vanishes, Proposition 4.1 implies that n = 0 or n = 1. In other words, we have 


v—uv2=1 of vow? =72-1. 


In the first case, we have u = 0, v = | and hence y = 0, which is not the case. In 
the second case, we have u = v = —1 and hence x = —3, y = 2. 
This proves the proposition. 


We see, incidentally, that the nonzero solution (x,y) = (43,2) satisfies 
x = 0 (mod 3), as predicted by Lemmas 3.2 and 3.3 of the previous chapter. 
However, these lemmas were never used in the proof of Proposition 4.2. 


Exercises 


4.1 Compute ord3(k(k — 1)) for 2 < k < 10. Show that the 3-adic valuation of 
k(k — 1) is at most [5] for k > 2. Show that the inequality is strict when k > 5. 
4.2 Let f € QLX] be an irreducible monic polynomial and let a € Q denote a zero 
of f. Put F = Q(a). Show that for every t € Q, the norm (from F to Q) of 
t — a is equal to f(t). 
4.3 (a) Show that the ideal generated by 2 in the ring Z[V2| is the cube of the ideal 
generated by 2. Compute the residue field of the prime V2. 
(b) Show that the ideal generated by 3 is the cube of the ideal generated by 
nm = 1+ /2. Compute the residue field of the prime z. 


4.4 The goal of this exercise is to show that the unit group of the ring Z[\/2] is 
generated by —1 and n = 2 — 1. Consider the injective ring homomorphism 
Q(/2) > R x C determined by 


V2 v> (V2, e142). 
We denote it by x + (x’, x”). For x € Q(./2), we define its “length” ||x|| by 
Ie? = [xP + ke" 
(a) Leta, b,c € Qand putx =a+ by/2 + cv/4. Show that 
Il)? = 3(a? + V4 + 20°02). 


/ 


(b) Show that 7’ is a real number satisfying ‘ <7’ < 1. Let wu be an arbitrary 


unit of ZLV/2]. Show that for some integer k, either the unit ¢ = +u n* or 
e = tu—'nf satisfies  < e' < 1. 
(c) Let ¢ be a unit of Z[/2] satisfying 5 < e’ < 1. Show that |e”|? < 2 and 


hence |le||? < 5. Use part (a) to show that e = +1. 


5 
Runge’s Method 


In this chapter, we describe C. Runge’s method to determine the integer solutions 
of certain Diophantine equations (Fig. 5.1). Runge described his method in a 1887 
paper in Crelle’s journal [40]. In this section, we explain it along the lines of a short 
unpublished note by Yuri Bilu. This involves somewhat more mathematics than is 
required for the rest of the notes. The method is applied to prove theorems of Cassels 
and Mihailescu that regard Catalan’s equation. 

Our explanation of Runge’s method involves the theory of algebraic curves. It 
is certainly not indispensable to view Runge’s procedure as the analysis of the 
poles and zeroes of certain functions on algebraic curves, as we now proceed to do. 
Indeed, there is no trace of this point of view in Cassels’ paper [8]. Mihailescu appar- 
ently only heard of Runge’s method months after he proved Theorem 12.4. However, 
several mathematicians—the author included—have the feeling they understand 
better what is happening when they see the estimates and inequalities embedded 
in this kind of mathematical context. In addition, our geometric point of view seems 
to be close to the way Runge [40] himself looked at the matter. 

It is not necessary to understand the material in this section in order to read 
the rest of this book. Many readers may find this section useful, but others may 
prefer to skip it. To accommodate the latter, the proofs in chapters 6 and 12 are 
written without any reference to this chapter. They consist of a series of estimates 
and inequalities and can be read for their own sake. Only at the end of each proof is 
the relationship to the point of view presented in this chapter explained. 

Let C be a smooth, projective, absolutely irreducible algebraic curve over Q. 
The complex points of C form a compact Riemann surface C(C). It has a natural 
topology induced by the usual topology of C. Let Y be a nonconstant element of 
the function field Q(C). In other words, Y is a nonconstant function on C and is 
defined over Q. A point P in the set of Q-rational points C(Q) is called integral 
if the value Y(P) of the function Y in the point P is in Z. This integrality notion 
depends, of course, on the choice of the function Y. Let Q € C(C) bea pole of Y. It 
has coordinates in Q. Since Y is defined over Q, the Gal(Q/ Q)-conjugates of Q are 
also poles of Y. 


Proposition 5.1. Using the notation above, let U C C(C) be an open neighbor- 
hood of Q and its Galois conjugates. Then there are only finitely many integral 
points in C(Q) that are not contained in U. 
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Fig. 5.1 Carl Runge (1856-1927) 


Proof By the Riemann—Roch theorem, there exists a nonconstant function in Qc) 
that has no poles outside Q. Multiplying this function by its Gal(Q/Q)-conjugates, 
we obtain a nonconstant function Z in the function field Q(C) that has no poles 
outside the set Q = Qj, Qo,..., QO; of Galois conjugates of Q. The function field 
Q(C) has finite degree over the field Q(Y). By Exercise 5.3, the function Z is integral 
over the ring Q[Y]. By multiplying Z by a suitable nonzero integer, it becomes even 
integral over Z[Y]. As a consequence, for any integral point P € C(Q), the rational 
number Z (P) is integral over Z. Gauss’s lemma implies then that Z (P) is an ordinary 
integer. 

Now, since the points Q;, Qo,..., Q, are the only poles of the function Z, the 
continuous function P +> |Z(P)| from the set C(C) — {Q;, Qo,..., Q;} to R is 
bounded on the compact set C(C) — U. Therefore, there are only finitely many 
possibilities for the integer Z (P). Since the function Z is not constant, there are only 
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finitely many points P € C(Q) for which Z(P) has a given value. It follows that 
there are only finitely many integral points on C that are not in U, as required. 


In certain situations, one can in this way actually prove that the set of all integral 
points P € C(Q) is finite. This happens, for instance, if the pole divisor of the func- 
tion Y has at least two Gal(Q/ Q)-orbits. Indeed, if Q and R are two nonconjugate 
poles of Y, we can find open neighborhoods U of Q and its conjugates and V of R 
and its conjugates with the property that UM V = J. Then we have 


C(Q) c (C(C) — U) U(C(E) — V), 


and an application of Proposition 5.1 to the open sets U and V shows that the set of 
integral points in C(Q) is finite. 

Another favorable situation occurs when all conjugates of the pole Q of Y are 
complex rather than real. In this case, there exists an open neighborhood U of Q 
and its conjugates with the property that U M C(R) = J. It follows that all rational 
points P € C(Q) are not contained in U. Therefore, |Z(P)| can be bounded for all 
integral points P € C(Q). 

The existence of the function Z is guaranteed by the Riemann—Roch theorem, but 
in practice one explicitly writes down a function Z that visibly has no poles outside 
the pole divisor of Y. This is done by showing that |Z(P)| remains bounded as P 
approaches any of the poles of Y that are not conjugate to Q. The very existence 
of these bounds implies then that the function Z has no poles outside Q and its 
conjugates. 

If the bound on the absolute values of Z (P) for P ¢ U is explicit, then the 
approach described here leads to an effective procedure to determine all integral 
points P € C(Q) that are not in U. 

Before illustrating the method by means of some explicit examples, we recall a 
basic fact from analysis. 


Lemma 5.2 Let c > O and let F(t) be a function of a real variable that ism + 1 
times differentiable in the interval {t € R: |t| < c}. Let F,,(t) be the sum of the 
terms of degree < m of the Taylor series around 0 of F(t). Then for all t with |t| < c, 
there exists t’ in the closed interval with endpoints 0 and t such that 


pti 
F(t) — F(t) = ———F™ V(’). 
(t) (t) (aD! (t’) 


Here F'"*") denotes the (m + 1)st derivative of the function F. 


Proof This is just the the remainder term of the Taylor series expansion of F(t) 
around 0. See [39, Thm. 5.15]. 


In the first example, we make use of the following estimate. 
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Lemma 5.3 Let F(t) be the function of a real variable t given by F(t) = (1 +t+ 
t*)'/2, Let F,(t) = 1+ St - at? denote the sum of the terms of degree at most 2 of 
the Taylor series of F(t) around 0. Then we have 


IF@) - FO < ral for -1/2<t<1/2. 


Proof The polynomial F(t) is also the sum of the terms of degree at most 2 of the 
Taylor series of the function (1 + 1)'/? around 0. By Lemma 5.2, we have 


t/>3 
14p- Aw <3 
| +1) aH = arg 


Wee, 
for some t’ between 0 and ¢. Since |t’| < oo the right-hand side is at most xB \t|?. 


The difference between F(t) and the function (1 + 1)!/? is equal to 


[r\* 
(+r'/24+ 444+ 04)!/2" 


IF@)—-d4+n'?| = 


Since |t| < 5, this is also at most ai |t|>. Combining the two estimates, we get 


IFO —- Aw) <|FO-d4+9'7?| +10 4+9!? - bol 


1 
|t| ’ 


J2 


3 
¢/" = 


IA 


It}? + 
2 


2/9 2/2 


as required. 


Proposition 5.4 The only integral solutions x, y € Z of the equation 


yexte 4 


are given by (x, y) = (—2, £3), (-1, £1), (0, £1), and (2, +5). 


Proof Consider the algebraic curve given by the equation Y? = X*+ X* +1. The 
associated homogeneous equation Y* Z* = X*+ + X3Z + Z* describes a projective 
irreducible curve in the projective plane P*. This curve has a singularity at infinity. 
An equation that describes a curve with the same function field, but that is smooth 
in a neighborhood of infinity, is given by T7 = 1+ S + S*, where T = Y/X? and 
S = 1/X. The points at infinity are the points for which S = 0. We let Q denote the 
point corresponding to T = 1 and Q’ the point with T = —1. A point (x, y) on the 
curve approaches Q when y — +00, while it approaches Q’ when y > —oo. 

We consider the function Y. By Gauss’s lemma, a rational point P = (x, y) on the 
curve is Y-integral precisely when it is integral in the usual sense: It means x, y € Z. 
The points Q and Q’ are the only poles of the function Y. As a consequence, the 
points Q and Q’ are also the only poles of the function X. 
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The function 


is a polynomial in X and Y. Therefore, its poles can only be Q or Q’. The function 
remains bounded as the point (x, y) approaches the pole Q or, equivalently, when 
|x| and y + +o. Indeed, since 


1-4 
yoxt+x41 = (14245). 
Xx x 


(2)-9G)|2 aa 


whenever |x| > 2. Therefore, the only pole is Q’. Multiplying by 8, we obtain the 
function 


Lemma 5.3 implies that 


1 1 
yout det a = oe 


Z =8Y —8xX?-4X +1. 


It is integral over Z[X] and hence has integral values in integral points of the curve 
Y* = X*+ X? +1. Since the function Z attains odd values in integral points, it does 
not vanish. It follows that we have 


4/2 
(Size iy ake seep. 


This implies that when x, y € Z satisfy y> = x* + x* + 1, we must have |x| < 
4/2 =5.6.... Checking all possible values —5 < x < 5, we find that xt4+x3 $1 
is a square precisely when x = —2, —1, 0, 2. This proves the proposition. 


Non-example 5.5 (The Pell equation) Let d be a natural number. The curve given 
by the equation X* — dY? = 1 has two points, Q and Q’, at infinity. They become 
visible when we consider the associated projective curve C in P* given by the 
homogeneous equation X* — dY* = Z?. Putting T = X/Y, they are given by 
T = +d. The points Q and Q’ are the poles of the functions X and Y. If d is a 
square, Q and Q’ are not conjugate over Q and Runge’s method applies. The sets 
U = {(x, y) € C(R): xy > O} and U’ = {(x, y) € C(R) : xy < O} are disjoint 
open neighborhoods of Q and Q’, respectively. 


Actually, it is easy to see directly that when D is a square, the only integral 
solutions to the equation x* — dy? = 1 are given by x = +1 and y = O. On the 
other hand, if D is not a square, we obtain a Pell equation. In this case, Runge’s 
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method does not imply that there are only finitely many solutions x, y € Z. This is 
explained by the fact that the two poles are conjugate over Q. This is a good thing, 
because it is well known that the Pell equation X* — dY* = 1 admits infinitely many 
solutions x, y € Z. 

In various parts of the proof of Catalan’s conjecture, Runge’s method is applied 
to families of curves. The curves and the bounds one obtains typically depend on 
the exponents p and gq of Catalan’s equation. It is essential that one controls this 
dependence. We illustrate the method in a simple case. 

We make use of the following lemma. Let g > 2 be an integer and let F(t) be the 
function of a real variable t > 0 given by 


14 74\ 12 
F(t) = ( ae ) . 
1+t 
Lemma 5.6 Let F,,(t) denote the sum of the terms of degree at most m of the 
Taylor series around 0 of F(t). Form < q, we have 


|F(t)— F,(t)| < tt! for teR with O<t <1. 


Proof Since we have m < q, the polynomial F,,,(t) is also the sum of the terms of 
degree at most m of the Taylor series expansion around 0 of the function (1 +1)7!/. 
It is convenient to use the latter function when doing the estimates. When 0 < t < 1, 
we have 


IF) — Fa) < | FOO - +097?) 410 +0971? = Fa 


= id é -1/2 gmt 
= d+)! m el + fynt3/2° 


The estimate of the first term is elementary. The second term is estimated by 
applying Lemma 5.2 to the function (1 + t)~!/?. By Exercise 5.6, the absolute value 
of the binomial coefficient is at most 1/2. This leads to 


1 1 
| F(t) — Fin(t)| < ¢"*" (Ge 4. 5) <p", 


as required. 


We give an alternative proof of Lemma 3.2. In other words, we show the 
following. 


Proposition 5.7 Let q > 3 be prime and suppose x,y are nonzero integers satis- 
fring the equation x? — y1 = 1. Then we have x = 0 (mod q). 


Proof Letx, y € Zbea nonzero solution and suppose x ¥ 0 (mod q). We have 


a) 2 
+1 = ye 
(y (Ze 
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By Exercise 3.2, the gcd of the two factors on the left divides g. Since x 4 0 (mod q), 
the gcd is therefore equal to |. By Exercise 2.3, each factor is then itself a square times 
+1. Since y? = x? — 1, the number Y is positive. This implies that both factors y + 1 
and (y? + 1)/(y + 1) are positive, so that we actually have plus signs. Therefore, we 
have for certain integers u, v € Z, 


y+l=w’, 
ase 
y+l , 


The point (y, v) lies on the algebraic curve given by the equation 
yo} yi? 4...-Y¥+1=V". 


The algebraic curve described by this equation is irreducible. The associated homo- 
geneous equation Y?~! — y?-?Z +... — ¥Z4-! + ZI"! = V*Z4-3 describes a 
projective irreducible curve in P”. This curve has a singularity at infinity. An equation 
that describes acurve with the same function field, but that is smooth in a neighborhood 
of infinity, is given by W7 = 1—U+...+ U4—!, where W = V/Y@-Y? U =1/Y. 
This shows that the curve C has two points, Q and Q’, at infinity corresponding to 
W = —land W = 1, respectively. The function on C with respect to which we 
define the integrality is Y. In other words, a rational point (y, v) is integral when 
y € Z. Gauss’s lemma implies that then v is also in Z. The function Y has two zeroes. 
Therefore, it has two poles—the points Q and Q’. 

We want to show that there are no rational points (y, v) on the curve C with 
y > 0. By symmetry, we may assume that v > 0 as well. The subset U = 
{(y, v) € C(R) with vy4—-)/? < 0} U {Q} is an open neighborhood in C(R) of the 
pole Q. All rational points (y, v) of C with y, v > 0 are contained in the complement 
of U. Proposition 5.1 implies that there are at most finitely many integral points (),z) 
outside the open set U. 

We now apply Runge’s method. We exhibit a function Z without any poles 
outside Q. Set m = (g — 1)/2 and consider the function 


1 
Z=V—-Y" Fy (+) ; 
We 


where F,,,(t) is the polynomial that occurs in Lemma 5.6. We have 


1 _1l _l 
Z= Vey" pay AS 2 \y—( 2). 
2 m—1 m 


Since neither function Y nor V has any poles outside Q and Q’, the same is true for Z. 
Next we show that Z(P) tends to zero when P approaches Q’. Let P = (y, v) with 
y, v > 0. Then we have 
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1/2 
ast) " (5) 

v= = F({—-). 
Ces y 


By Lemma 3.1 (ii), we have y > 27~! — 2 > 2. Therefore, Lemma 5.6 implies 


1 1 1 
cry =|o—s*Fa(S)I="(#(5)-™(5)) 
¥. y y. 
esd 
ymtl i y 
It follows that Z(P) tends to zero when y — +00, ie., when P = (y, v) 


approaches Q’. Therefore, the function Z has no pole at Q’. 


The function Z introduced above is integral over Q[Y], but not over the ring Z[Y]. 
This follows from the fact that the coefficients of F,,(t) are not in Z. Indeed, by 


Exercise 5.7, the binomial coefficients (2) have denominators equal to 2+"), 
Therefore, we set D = 2+ and consider the function Z’ = DZ. This is a 
polynomial in Y and V with coefficients in Z. This is the function that occurs in the 
description of Runge’s method given above. It is integral over Z[Y] and therefore 
takes integral values in all integral points P of the curve C. We have the following 
estimate for Z/(P) = DZ(P) when P = (jy, v) is an integral point with y, v >0: 


; D 
|Z (P)| = DIZ(P)| < ce 


We use additional information to complete the proof of the proposition. By 
Exercise 5.5, we have ord)(m!) < m and hence D < 2”+"-) = 24-7, By 
Lemma 3.1 (ii), we have y > 24~! — 2. This implies 


q-2 


2 
|Z'(P)| < ———. 
WD 


which is at most 1 because we have q > 3. Since Z’(P) is an integer, it follows that 
Z'(P) = 0. This implies 


Dv = s or 


k=1 


Since D is even, so is the number on the left. On the other hand, Exercise 5.7 implies 
that for O < k < m, the number D(2) is an even integer, while for k = m, it is 
odd. This is impossible. 

This contradiction shows that there are no integral points P = (y, v) with y ¥ 0. 
Therefore, our assumption is wrong and we must have x = 0 (mod q), as required. 
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Runge’s method extends naturally to curves over algebraic number fields. Let 
F be a number field and let Of denote its ring of integers. Let C be a smooth, 
projective, absolutely irreducible algebraic curve over F and let Y be a function on 
C that is defined over F. An F-rational point P € C(F) is called integral if the value 
of the function Y in the point P is in Or. Let Q be a pole of Y and let Z be a function 
in F(C) that has no poles outside the set Q = Qi,..., Q, of Gal(F'/F)-conjugates 
of Q. By multiplying Z by a suitable nonzero integer, we may assume that it is 
integral over Of[Y]. For each embedding ¢ : F C C, one can effectively bound 
the values of the function |@(Z(P))| in the points P that are outside a given open 
neighborhood U of $(Q}), ..., 6(Q,-) in the Riemann surface C(C). 

In certain situations, one can use this approach and show that the set of a// integral 
points P € C(F) is finite. Suppose, for instance, that the poles of the function Y are, 
up to Gal(F’/F)-conjugacy, given by Q“,..., Q. For eachi = 1,...,r, let Z; 
be a function in F(C) that is integral over the ring F[Y] and has no poles outside 
Q. Then it also has no poles in any of the Gal(F/F')-conjugates of 0. For every 
embedding @ : F ~ C, and eachi = 1,...,r, let U; C C(C) be a neighborhood 
of the point ¢(Q™) and its Gal(F /F’)-conjugates. In addition, suppose that the U; 
are disjoint. Then for any point P in C(F), the point ¢(P) € C(C) is contained 
in at most one set U;. It follows that the absolute values of the complex numbers 
o(Z;(P)) € C can be bounded for each j # 7. 

Therefore, if the number r of poles of Y exceeds the number of complex embed- 
dings of F into C (taken up to complex conjugation), the box principle implies 
that for each point P, at least one function Z; has the property that all complex 
numbers @(Z;(P)) are bounded. When Z;(P) is in Of, this gives us an explicit 
finite list of possible values Z;(P). From this it is easy to determine all integral 
points P € C(F). 

Another favorable situation occurs when F is totally real and Y has a pole Q 
whose conjugates are all complex rather than real. In this case all points P € C(F) 
are far away from Q and its Galois conjugates with respect to every embedding 
o: F — C. The latter situation occurs in Mihailescu’s proof of Theorem 12.4. 


Exercises 


5.1 Let R C S be an extension of rings. Let x € S. Show that the following are 
equivalent: 


(a) the element x is integral over R; 

(b) we have xM C M for some finitely generated R-submodule M of the addi- 
tive group of S; 

(c) the subring R[x] of S is finite over R. 


5.2 Let R be an integrally closed domain with quotient field K and let L be a finite 
extension of K. Let x € L. Then the following are equivalent: 


(a) the element x is integral over R; 
(b) the minimum polynomial of x over K is contained in R[X]; 
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5.3 


5.4 


5.5 


5.6 


5.7 
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(c) the characteristic polynomial of x is contained in R[X]. (The characteristic 
polynomial of x is the characteristic polynomial of the matrix of the K- 
linear map L —~> L that is given by multiplication by x.) 


Let F be a field and let C be a smooth, irreducible, projective curve over F. 
Let Y be a nonconstant element of the function field F(C) and let D be its pole 
divisor. Let g be a function in F(C) that has no poles outside D. Show that g is 
integral over FY]. (Hint: See [1, Cor. 5.22].) 

Let R be a Noetherian integrally closed domain with quotient field K of char- 
acteristic zero. Let L be a field extension of K of finite degree. Show that the 
integral closure of R in L is finite over R. (Hint: See [1, Prop. 5.17].) 

Let n > | be an integer and let P be a prime. For any x € R, we let [x] denote 
the integral part of x. 

(a) Show that ord,(n!) = ical}. 

(b) Show that ord,(n!) = 7S) where s(n) is the number of digits of n written 


p-l 
in base p. Conclude that ord,(n!) < 


n 
p-\- 


For any domain R of characteristic zero and any x € R andk € Zs, we define 
the binomial coefficient (1) in the quotient field of R as follows. We put (7) =1 
when k = 0 and we let 


for k>1. 


x Lae De cere = K = 1)) 
(;)- ee Dea tk, ; 


Let k € Z.o. Show that for x € R, one has 
(a) 


(b) 


() <|x| for -l<x <0. 

(a) Let / be a prime and let x be contained in the ring Z, of /-adic integers. Show 
that G) is contained in Z, for every k € Zo. (Hint: The function x b> G) 
is a continuous map from the field of /-adic numbers Q, to itself. Now use 
Exercise 2.4 and the fact that Z is dense in Z;.) 

(b) Let x € Q*. Show that for every natural number k, the binomial coefficient 
(3) is contained in the ring Z[x]. In other words, show that (7) is J-integral 
for all primes / that do not divide the denominator of x. [Hint: Use part (a).] 

(c) Let x € Q* and suppose that the denominator of x is a prime number gq. 
Show that the denominator of (j) is equal to gto), 
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5.8 Leta € Rand F(t) = (1+ 1t)*. Let m > @ and let t € C with |t| < 1. Using 
the notation of Lemma 5.2, show that 


laa 


a 
|F(t) — Fn(t)| < (,, rf i) Cd — [pete 


5.9 Let F,G € Z[X] be two monic polynomials of degree n > 2. Let C be the 
projective curve given by the homogeneous equation Z” F(X/Z) = Z"G(Y/Z) 
in the projective plane P?. 


(a) Show that the infinite points of the algebraic curve C are the points charac- 
terized by the equation 7” = 1, where T = Y/X. 

(b) Show that the function X — Y in the function field Q(C) has no pole at the 
infinite point given by T = 1. 

(c) Bound the size of the integral points on C in terms of the polynomials F 
and G. 

(d) Find all integral solutions of the equation X? — X +1 = Y°+2Y?. 


6 
Cassels’ theorem 


In this chapter, we prove J.W.S. Cassels’ theorem (Fig. 6.1). See [8]. For the sake of 
reference, we prove the following easy lemma. 


Lemma 6.1 Let p,q > 2 be integers and suppose that x,y are nonzero integers that 
are a solution to the equation x? — y4 = 1. Then p and q are necessarily distinct. 


Proof Indeed, if p were equal to g, then we would have x”? — y? = 1. But pth 
powers are far apart. When the distance between them is 1, necessarily one of x,y 
is zero. See also Exercise 3.3. Therefore, we have p # q, as required. 


When p and q are both odd, there is a symmetry: If (x, y, p, g) is a solution to 
Catalan’s equation, then so is (—y, —x, q, p). Therefore, we may assume p > q. 

Cassels’ theorem is concerned with Catalan’s equation for the odd prime expo- 
nents p and g. We first prove the easy part of his result. 


Proposition 6.2. Let p > q be two odd primes and suppose that x,y are nonzero 
integers for which x? — y? = 1. Then both of the following hold: 


(i) q divides x; 
(ii) |x| >qtqr. 


Proof (i) Suppose that g does not divide x. By Exercise 3.2, the two factors y + 1 
and (y? + 1)/(y + 1) of y? + 1 = x? are coprime. Since their product is a pth power, 
Exercise 2.3 implies that we have 


y+1=bD?, 
Odds sp 
an 


for certain b,u € Z. It follows that x? — (b? — 1)4 = 1. This implies that the 
two pth powers x? and b?% are relatively close to one another. Too close, as we 
show now. As a function of x, the expression x”? — (b? — 1)? is strictly increasing. 
When b > 0, substituting x = b% gives a value exceeding 1, while substituting 
x = b4 — | leads to a negative value. Indeed, we have (b4 — 1)? < (b? — 1)? 
because p > q. On the other hand, when b < 0, substituting x = b’ — | gives a 
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Fig. 6.1 J.W.S. Cassels. (Reproduced from J Have a Photographic Memory by Paul R. Halmos. © 
American Mathematical Society, 1993, with permission from the American Mathematical Society.) 


value exceeding 1, while substituting x = b leads to a negative value. Indeed, we 
have (—b? +1)? < (—b4 +1)? because p > q. See Exercise 6.1 for the inequalities. 
In either case, an integer x for which the expression x”? — (b? — 1)? is equal to 1 
does not exist. 

We conclude that g divides x, as required. 


(ii) Part (i) implies that the gcd of the factors y+1 and (y? + 1)/(y + 1) is equal 
to qg. Exercise 6.2 implies that (y’ + 1)/(y + 1) = q (mod q’). It follows that g? 
does not divide (y? + 1)/( + 1). Since g” divides y? + 1, it must be the case that 
q’' divides y + 1. It follows that the numbers 


yer A y+ 
and ; 
qiy + 1) qr? 


are coprime integers. Since their product is a pth power, Exercise 2.3 tells us then 
that 


y+l=q?'bP 
year 2 
yer’ 
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for certain b,u € Z. Note that uw is positive by Exercise 3.1. Since Exercise 6.2 
implies 

ae. 

y+ 


we have qu? = q (mod gq?!) and hence u? = 1 (mod q?~”). Since p exceeds q, it 
does not divide the order of the group (Z/q?~*Z)*. It follows that u = 1 (mod g?~?). 
We claim that u 4 1. Indeed, if u = 1, we have g = qu? = (y? + 1)/(y+ 1). Since 
x #0 and hence y # —1, Exercise 6.4 implies that g = 3 and y = 2. This means 
that x? = y? + 1 = 9, which is impossible. From the fact that u > 0, it follows that 
u > 1+4q?~?. Since we have x = qub, part (ii) follows. 

The proof of Proposition 6.2 (i) is a very easy instance of Runge’s method. The 
relevant curve is given by X? — (B? — 1)? = 1. The points at infinity are given by 
T? = 1, where T is the function X/B’. When p > gq, the function Z = B’ — X 
is zero in the infinite point Q given by T = |. Therefore, Z has no poles outside the 
infinite point Q’ given by T?~! +... + T + 1 = 0. The estimates that appear in the 
proof show precisely that the values of the function Z on the real points of the curve 
are in the half-open interval [0, 1). This implies that Z vanishes on integral points. It 
easily follows that the only integral points on the curve are given by (X, B) = (0, 0) 
and (1, 1). 

The following lemma is used in the proof of Cassels’ theorem. For odd integers 
Dp > q, let F(t) denote the function of a real variable ¢ given by 


= q (mod(y + 1)), 


F(t) =((+n? -2?)'". 
Recall that [x] denotes the integral part of x € R. 


Lemma 6.3 Let p and q be odd integers satisfying p > q > 3, letm = ll + 1, 
and let F,,(t) denote the sum of the terms of degree at most m of the Taylor series 
around 0 of the function F(t). Then we have 


[rj 
< 71 \#)\2? 
(1 — ||) 


Proof Since we have m < p, the polynomial F,,,(f) is also the sum of the terms of 
degree < m of the Taylor series of the function (1 + t)?/4 around 0. It is convenient 
to do the estimates using the latter function. For t € R satisfying |t| < 1, we have 


|F(t) — Fin(t)| fort € Rwith |t| <1. 


IFO-—Fa@®| < |FQ-d+n?4| +/+ 02/4 — Fa@l. 


We estimate the first term as follows: 


1 ipa 
IF) -—U+0)?/4| < =|t/P|e']@ 
q 


IA 


1 
=|t/P = |p? 
q 


1 sel? 
qq tl)" 
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The first inequality is an application of the mean value theorem to the function 
x t> x!/7, Here ft’ is a real number between (1 + f)? and (1 +1)? —t?. Exercise 6.5 
shows that |t’| => (1—|t|)?, which implies the second inequality. The third inequality 
follows from the fact that p(1 — D 2. 

We estimate the second term as follows: 


(1 +2)?4 — Fa(t)| < |e" 


P 
& J] = toate 
m+1 


1 \ej"! 
m+1(1— ||P" 
The first inequality follows from Lemma 5.2 or Exercise 5.8. By Exercise 5.6(a), 
the binomial coefficient is at most 1/(m + 1). The second inequality follows from 


this and the fact that 2 —m — 1 > —2. 
Combining the inequalities, we find 


IF®)— Fal < (e+) 2 
Gad) Ce 


Since p > m+ 1 and both m and q are at least 2, the right-hand side is at most 
|r/"*!/(1 — |t|)*, as required. This proves the lemma. 


Theorem 6.4 (J.W.S. Cassels, 1960) Let p,q be two odd primes, and suppose that 
x,y are two nonzero integers for which x? — y4 = 1. Then q divides x and p 
divides y. 


Proof By Lemma 6.1, we have p 4 qg. By symmetry, we may assume p > q. The 
fact that the “small” prime q divides x was proved in Proposition 6.2. Now we show 
that the “large” prime p divides y. The proof is an application of Runge’s method. 


The proof is by contradiction. Suppose that p does not divide y. By Exercise 3.2, 
the numbers x — 1 and (x? — 1)/(x — 1) are then coprime and Exercise 2.3 therefore 
implies 

x—-l=a’‘, 
for a certain nonzero a € Z. Substituting this into Catalan’s equation, we obtain 


yl = (a? +1)? - 1. 


Writing F(t) for the function ((1 + 1)? — t? yl/ 4 (for t € R), this means that we have 
y= a? F(4). As in Lemma 6.3, we set m = [7] +1. We set 


= 1 
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Since the exponent mq — p is positive, the number z is equal to a polynomial expres- 
sion in a and y. The coefficients of the polynomial 


1 P P 
t” Fn (;) = fT 4 (i) +...+(4) 
t 1 m 


BR 


are not integral. Indeed, they are the binomial coefficients ( ny) forO < k <m. By 
Exercise 5.7, the denominators of these numbers are equal to g‘*°")), Therefore, 
each denominator divides D = gta", and the number 


m Pp 
Dz = Da™? ?y— a" D{ 4 \a~# 
yam > D(*) 


k=0 


is an integer. In addition, Dz is not zero, because it is not congruent to zero modulo gq. 
This follows from the fact that all terms in the sum except the mth one are integers 
that are divisible by q. Indeed, the mth coefficient is equal to the binomial coefficient 
m-+ord, (m!) 


(:) multiplied by its denominator D = q 
We conclude the proof by estimating the size of Dz. We have 


a (r(8)-#(2) 


Since x 4 0 but is congruent to 0 (mod q), we have a £ 0, +1 and hence |a| > 2. 
Therefore, Lemma 6.3 with tf = 1 /a‘ applies. We have 


la|? 1 1 
[z| < a < —___—_. < a 
~ (jal@—1? ~ jal@-2 ~ |x|-3 
By Proposition 6.2 (ii), we have |x| > q?~! + q and hence |x| —3 > q?"!. 
Therefore, 

D 
= 
|x| —3 

We now come to the critical part of the estimate. It is essential that we show that the 
exponent is negative. By Exercise 5.5(b), we have 


|Dz| m-rordg (m!)—(p—1)_ 


s 4 


m bordy(on) —(p— 1) sm(14# —) ~(p= 
Go 


Since m < a + 1, the right-hand side is strictly smaller than 


(£41) (1+ )-w-n _ 3-@-2@-2 
q (ime 


q-1 


and is <0 because we have g > 3 and p > 5. 
It follows that the absolute value of the nonzero integer Dz is strictly smaller 
than 1. This contradiction proves the theorem. 
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In terms of the description of Runge’s method given in chapter 5, the algebraic 
curve C that is relevant for the proof of Cassels’ theorem is given by the affine 
equation 


¥? =(1+ A2)?? —1 = A? + pA?) +... + pA. 


By Gauss’s lemma, the integral points P = (a, y) € C(Q) are precisely the rational 
points at which the function A assumes integral values. The pole divisor of A is the 
divisor of infinite points of C. Setting W = Y/A”, we have 


P P 

Wie leg tia? agape 
and we see that the pole divisor of A is given by the equation W! = 1. Therefore, 
A admits, up to Galois conjugacy, two poles: The pole Q’ is given by W = 1 and 
is defined over Q, while the pole Q consists of the Galois conjugates of the point 
given by W = ¢,, where ¢, denotes a primitive Qth root of unity. The pole g is 
defined over the cyclotomic field Q(¢,). Since Q is not defined over R, the rational 
points on the curve C are all “far away” from it. The discussion above implies that 
the function 


1 
Z = A™-Py — AIF, (=) 
Aq 


admits no poles outside Q. Indeed, since mg — p > 0, the function Z is a polyno- 
mial in the functions A and Y. Since both A and Y do not have any poles outside Q 
and Q’, neither does Z. The estimate shows that Z(P) tends to zero when P tends 
to Q’. Therefore, Q’ is not a pole of Z either. The function Z is integral over Q[A] 
but not over Z[A]. This is caused by the denominators of the coefficients of F,,(t). 
The function Z’ = DZ is integral over the polynomial ring Z[A]. Therefore, it 
takes integral values in the integral points in C(Q). As we have seen in the proof of 
Theorem 6.4, this leads to the determination of all integral points. 

An essential point of Cassels’ proof is that in the application of Runge’s method, 
one is able to do the estimates uniformly in the exponents p and q. 


Corollary 6.5 Let p,q be odd primes and let x,y be nonzero integers satisfying 
Catalan’s equation x? — y4 = 1. Then 


(i) there exist a, v € Z such that 


P| 
y=pav, x—1=pt'a!, 7 = ss 
(ii) there exist b,u € Z such that 
7+] 
x=qbu, y+l=q?'b?, aie = qu’; 
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(iii) we have 
|x| = max(p?~' —1,q?"'4+ 4), |y| = max(q?~' — 1, p?7'4 p). 


Proof Since p divides y, we have x? = | (mod p) and hence x = | (mod p). Since 
p? divides x? — 1 and since (x? — 1)/(x — 1) = p (mod p’) by Exercise 6.2, we 
see that p?—! divides x — 1. An application of Exercise 2.3 to the factorization 


x—-1 xP-1 -(2) 

pk. “pos Np 
implies (i). Part (ii) follows by symmetry. It suffices to replace the quadruple 
(x, y, p,q) by (—y, —x, q, p). Note that both u and v are positive. For part (iii), 
note that a # 0, so that |x| > p?~! — 1 by part (i). If p < q, Exercise 6.3 implies 
that this bound exceeds g?~! + q and we are done. In the other case, Proposition 6.2 
(ii) implies |x| > q?~' + q, as required. The inequalities for y follow by the 
symmetry (x, y, p,g) @ (—y, —x,q, Pp). 


For a nonzero solution x, y € Z to Catalan’s equation, Corollary 6.5 provides 
us with three pieces of “local” information concerning a nonzero solution x,y to 
Catalan’s equation x? — y? = 1. We have 


e x =1(mod p?—') and y = 0 (mod p), (p-adic), 
e x =0(mod g) and y=—1 (mod q?"'), — (q-adic), 
e |x|, ly) = max(q?~! —1, pa! — 1), (Archimedean). 


The inequalities of part (iii) of Corollary 6.5. can rather easily be improved upon. 
This was done by S. Hyyr6 in 1964. See [18] or Exercise 6.6. 


Exercises 


6.1 Let b € R,,. Show that t + (b' + 1)!/" is a decreasing function from Ryo 
to Ryo. Show that t + (b' — 1)!" is an increasing function from Ryo to Ryo. 


6.2 Let g be an odd prime. 


(a) Leta € Zbe different from 1. Show that (a7—1)/(a—1) = q (mod (a—1)). 
(b) Leta € Z satisfy a = 1 (mod q). Show that (a4 —1)/(a—1) = q (mod q’). 


6.3 Let p > q be odd primes. Show that g?~! — 1 > p?-! + p. 
6.4 Let g be prime. Consider the equation 


yi 
y+1 


yet Sey? 4 Sp Jeg 
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6.5 
6.6 
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(a) For g => 5, show that the only solution to y € Zis given by y = —1. 
(b) Determine all solutions y € Z for g = 3. 
For any two real numbers p, x > 1, show that (x + 1)? -(~—1)? > 1. 


(S. Hyyré [18]) Suppose that p,q are odd primes and that x, y € Z satisfy 
x? — y? = 1. Corollary 6.5 says that there exist a, v € Z such that y = pav 
and x — 1 = p%~'a’. Similarly, there exist b, u € Z such that x = qbu and 
y+1l=q?!b?. 


(a) Show that a = —1 (mod q) and b = 1 (mod p). 
(b) Show that a 4 —1 and b ¥ 1. Conclude that |a| > g — 1 and |b] > p — 1. 
(c) Show 


Ix] > pT '(q-D1-1, 
lyi=q?-(p—1)? -1. 


7 
An Obstruction Group 


Nonzero solutions to Catalan’s equation x” — y? = | naturally give rise to elements 
in certain obstruction groups. Mihailescu’s theorems can be viewed as results about 
the size of the submodules generated by these elements. In this section, we introduce 
the notions and notations that are used in the remaining sections. 

Let p > 2 be prime, let ¢,, denote a primitive root of unity, and let Q(¢,,) denote 
the pth cyclotomic field. The Galois group G of Q(¢,) over Q is given by 


G = {o,: a € (Z/pZ)’}, 


where o, acts on ¢, by raising it to the ath power. Note that o_, is the automor- 
phism that, for every embedding ¢ : Q(¢,) — C, is induced by complex conjuga- 
tion. More precisely, we have $(o_;(x)) = o(x) for every embedding ¢ and every 
number x € Q(¢,). We usually write . for the element o_; € G. By Exercise 7.2, the 
ring Z[¢,] is the ring of integers of Q(¢,). Its ideal class group is denoted by Clp. 
The order of the finite group CI, is called the class number of Q(¢,). It is denoted 
by h,. We define the group E,, of p-units by 


Ep = Zf,, 51". 


By Exercise 7.2, the unique prime p lying over p in the ring Z[¢,] is generated 
by 1 — ¢,. The group E, is generated by the unit group Z[¢p]* and by the 
element | — ¢,,. All groups defined so far are modules over the group ring Z[G]. By 
Exercise 7.3, there is an exact sequence of Z[G]-modules 


ordy 


0 — Z[¢,]" > E,—>Z > 0, 


where ord, is the valuation associated to the prime ideal p. We sometimes use 
exponential notation for the action of Z[G] on multiplicative groups. 
Let g be an odd prime not equal to p. Consider the F,[G]-module 


H= {a € Q(¢,)* : ord,(a) = 0 (mod gq) for all primes t 4 p} /QG,)*. 
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In other words, an element a € Q(¢,)* (modulo gth powers) is contained in H if and 
only if the principal ideal generated by w is equal to a7p* for some k € Z and some 
fractional ideal a. The F,[G]-module His the obstruction group that was mentioned 
in chapter 1. It plays an important role in our presentation of Mihailescu’s proof. 

There is a natural homomorphism yy from H to the group Cl,[g] of ideal classes 
that are annihilated by g. It sends a to the class of the fractional ideal a for which 
(a) = at p* for some k € Z. 


Lemma 7.1 Let p, q be distinct odd prime numbers and let H be the F|G|-module 
introduced above. Then we have the following. 


(i) There is a natural exact sequence of F,{G]-modules 
0 —> E,/Et — H 5 Clylq] — 0. 


(ii) The automorphism « acts trivially on the group E/E}. 


Proof Since the prime p over p is a principal Z[¢, ]-ideal, the map y is well defined. 
It is easy to see that the natural map E,,/E% —> H is injective and that its image is 
contained in the kernel of y. Any ideal a witha’ = (@) is the image under y ofa € H. 
This shows that y is surjective. Finally, if (a) = p* for some k € Z, the number a is 
necessarily a p-unit. This shows that the sequence is exact. This proves (i). 

For any unit u € Z[¢,]* and every embedding ¢ : Qe) — C, the complex 
number (Zp) = o(u) /olu) has absolute value |. Exercise 7.1 implies therefore 


1- ; 7 bac, 
that wu! = re is a root of unity. Similarly, 


1-u 1-6, _ 1-o, _ 
d—-¢,) "= = =j = —$p 
1=é,)  1—f5 
is a root of unity. The group E, is generated by Z[¢,]* and by | — ¢,,. Therefore, 
any € € E, has the property that ¢'~ is a root of unity. Since the roots of unity in 
Qe p) have order dividing 2:, they are gth powers. This shows that the class of ¢ in 
the quotient group E,,/E% is c-invariant. This proves (ii). 


The following proposition explains the relevance of the group H with respect to 
Catalan’s conjecture. 


Proposition 7.2 Let p, q be distinct odd primes and suppose that x, y are nonzero 
integers that satisfy Catalan’s equation x” — y4 = 1. Then the class of the element 
x — ¢, modulo OS is contained in the obstruction group H. 


Proof We have 


[[@-9=5'. 


CEMp 
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Here j1, denotes the group of pth roots of unity. Since the right-hand side is a gth 
power, so is the left-hand side. The gcd of any two of the factors x — ¢ divides the 
prime ideal p = (1 — ¢,) of Z(¢,,)- Therefore, we have ord,(x — ¢,) = 0 (mod gq) 
for all primes t # p of Z[¢,,]. 

This proves the proposition. 


Several of Mihdailescu’s results translate readily into properties of the element 
x — ¢, in the obstruction group H. This is also true for some of the results obtained 
earlier. For instance, an important consequence of Cassels’ theorem can be trans- 
lated into a statement regarding the group H. 


Proposition 7.3. Let x, y € Z be a nonzero solution to Catalan’s equation. Then 
the element x — ¢, of H is not contained in the index-q submodule Ho of H given by 


Hy = {a € Q(¢))* : orde(a) = 0 (mod gq) for all primes t} /Q(¢p)*". 


Proof Corollary 6.5 of Cassels’ result implies x = 1 (mod p). Equivalently, it says 
that x — ¢, is divisible by p = (1 — ¢,). Since p divides x — ¢, exactly once, this 
implies that 


ord,(x — ¢,) 4 0 (mod q). 


This proves the proposition. 


Next we exploit the action of complex conjugation 1 € G on the obstruction 
group H. 

For any Z[G]-module M, we define its plus part Mt by M* = M!*' = {(14u)m: 
m € M} and its minus part M~ by M7 = M'!~ = {(1—0m:m € M}. Both M+ 
and M~ are Z[G]-modules. Writing Gt for the quotient group G/(z), the plus part 
M* is even a module over Z[G*]. When q is an odd prime number and M is a 
Z[G]-module, then M[g] = {m € M : gm = O} is also a Z[G]-module. We have 
M{q]* = Mt[q] and M[q]” = M7[q]. Since gq is odd, any F,[G]-module M 
is the direct product of M~ and M*. In particular, the obstruction group H is an 
F,[G]-module and is a direct product of its plus part H* and its minus part H~. 

The fixed field of « is Q(¢7), where ¢> denotes ¢, + c = 6, + USp). The 
quotient group Gt = G/(:) is the Galois group of Q(é as ) over Q. We denote the 
class group of the ring of integers of Q(é Fl yby C ne The minus class group Cl; is 
the quotient of the class group Cl, of Q(¢,,) by the image of the natural homomor- 
phism Cl} — Cl). 

There is a slight ambiguity in this notation. By Exercise 7.5, the natural map 
Cc ly — Cl), is injective and its image is contained in the plus part of C/,. The 
plus part may contain this image strictly. However, by Exercise 7.6, the index is 
at most a power of 2. Since gq is odd, this implies that the natural homomorphism 
C 5 [gq] —> Cl,[q]* is an isomorphism. Similarly, Exercise 7.7 implies that the 
natural map C/,[qg]~> —> Cl, [q] is an isomorphism. Since we only consider the 
q-parts of Cl, CE: and Cl,, this abuse of notation should not lead to any 
confusion. 
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The order of Cl} is called the plus class number and is denoted by h}. The 
minus class number h7, of Q(¢,) is the order of cl, . By Exercise 7.5, the map 
Cl; —> Cl, is injective. Therefore, we have h, = ane 


Proposition 7.4 Let p, g be two distinct odd prime numbers. 


(i) The natural homomorphism 
H- —>C I {@] 


is an isomorphism; 
(ii) taking plus parts of the modules in the exact sequence of Lemma 7.1, we obtain 
the exact sequence 


0 —> E,/E4 —> Ht —> Cl}[q] — 0. 


Proof This follows from Lemma 7.1. 


By Exercise 7.3, the group E,,/ Ej, is an F,-vector space of dimension (p — 1)/2. 
If q does not divide #G = p — 1, Proposition 13.7 of chapter 13 implies that E,/E% 
is a free F,[G* ]-module of rank 1. This need not be true when q divides p — 1. It 
does not hold for p =1129 and g = 3, for instance [42]. 

It is difficult to say something general about the dimensions of the F,-vector 
spaces cls [q] and Cl; [q]. 


Proposition 7.5 Let p be an odd prime. then 


= 1 
b= 2p| 4 — Buy. 
x odd 


where x runs over the odd characters of G = Gal(Q(¢,)/Q). The generalized 


Bernoulli numbers B,,, are defined by B,,, = ae, px). 


Proof This classical result goes back to E.E. Kummer [23]. Since we do not need 
the result for the proof of Catalan’s conjecture, we merely sketch the proof. See [50, 
Thm. 4.17] for more details. Briefly, the zeta function of Q(¢ ») is a product of the 
L-series L(s, x), where x runs over the characters x : G —> C*. Similarly, the 
zeta function of the subfield Qe ) is a product of the L-series L(s, x), where x 
runs over the characters of Gt or, equivalently, over the even characters of G. Here 
a character x is called even if x (u) = 1. It is called odd otherwise. 

It follows that the product [| x odd L(1, X) is equal to the quotient of the residues 
in s = 1 of the zeta functions of Qé,,) and QE5), respectively. The result now 
follows from the formula for the residue in s = 1 of the zeta function of a number 
field and the fact that for odd characters x we have 


Ti 
Ld, x)= aye 


Here t(x~!) denotes the Gaussian sum — Sar, x(a)~'e27'4/P € C. This completes 


the proof of the proposition. For more details, see [50, Thm. 4.9]. 
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Table 7.1 Minus class numbers h, for p < 100 


p hy p +h, p hy oP hy 
3 1 19 1 43 211 71 3,882,809 
5 1 23 3. 47 695 73 11,957,417 
7 1 29 8 53 4,889 79 100,146,415 
11 1 31 9 59 41,241 83 838,216,959 
13 1 37 37 61 76,301 89 = 13,379,363,737 
17 1 41 121) 67 85,3513 97 411,322,824,001 


By means of this proposition, it is easy to compute the order h7, of the group CI, 
The entries in Table 7.1 were already known to Kummer in 1850. The numbers 
h;, become very large as p grows. In practice, it is also rather easy to compute 
the structure of the group Cl,,. For instance, for p = 29, the class group Cl, is 
isomorphic to (Z/2Z)°, while for p = 31, it is cyclic of order 9. See [41]. 

At present, there are no good methods to compute the orders ht of the plus 
parts es bs of the class groups of Qé,,): For p < 71, the groups Ci are known to 
be trivial. Under assumption of the general Riemann hypothesis, this result can be 
extended to the primes p < 163. However, the group C ms has not been computed 
for a single prime p > 163. It seems that the groups C a are usually small and often 
trivial. 

For any given prime q, however, it is not difficult to compute the order of C ie [q]. 
For instance, for p = 877 and g = 7, the group C es [q] is a two-dimensional vector 
space over F7. See [42] for more information on the groups C Be 


Exercises 


7.1 Let ¢ be an integer in a number field F with the property that |@(e)| = 1 for all 
embeddings ¢ : F <~> C. Show that it is a root of unity. 

7.2 Show that Z[¢,] is the ring of integers of Q(¢,). Show that the Z[¢, ]-ideal p 
generated by 1 — ¢,, is a prime ideal of norm p. See [50, p. 11]. 

7.3 Let p be an odd prime and let E, denote the group of p-units of Q(¢,). 


(a) Show that the natural sequence 


ordy 
0 — Z{¢,]* —> E, — Z— 0 
is exact. Here ord, is the valuation associated to the prime ideal p = (1—¢,,) 
of the ring ALAR 
(b) Show that for every odd prime g # p, the F,-dimension of E,,/E% is equal 
to (p — 1)/2. 


7.4 Using techniques from algebraic number theory [24, Ch.V, s. 4], show directly 
that the class numbers of Q(¢,) are equal to 1 when p = 3,5, or 7. 
7.5 Let p > 2 be prime and let ~ C Q(¢,)* denote the subgroup of roots of unity. 


(a) Consider the homomorphism 


ken Cl > Clip => p/p 
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that maps the class of a Qe; )-ideal J for which the Z[¢, ]-ideal generated 
by J is equal to (a), to a!~'. Show that it is well defined and injective. 

(b) Show that the map Zé)" —> pu given by ub u'~ is well defined. Show 
that its image is ju. 

(c) Show that the natural map C a —> Cl) is injective. 


Let p > 2 be prime. Show that the image W of the natural map C es — Cl, 
is invariant under 1. Show that the quotient of the group of t-invariants of Cl,, 
by the subgroup W, is annihilated by 2. 

Let p be a prime. Show that the kernel and cokernel of the natural map 
C ce — Cl, — CI, are finite 2-groups. Show the same for the natural 
map C/,[1 + +] —> Cl, — Cr. Here C/,,[1 + ¢] denotes the kernel of the 
norm map given by x +> x(x) for all x € Clp. 

Let Ho be the subgroup of the obstruction group H that is defined by Hp = {a € 
H : ordy(a) = 0 (mod g)}. Compute the index [H : Ao]. 


8 
Small p or q 


In this chapter, we present Mihailescu’s proof of Theorem IV mentioned in chapter 1. 
The proof is by a p-adic argument. Let p, g be distinct odd primes. In Chapter 7, we 
introduced the obstruction group 


H= {a € Q(¢,)* : ord,(a) = 0 (mod gq) for all primes t 4 p} ING. 


By Exercise 7.2, the number z = ¢, — | generates the unique prime divisor of p in 
the ring Z[¢,]. We have (t)?-! = (p). Recall that. € G = Gal(Q(é,,)/Q) denotes 
complex conjugation. 

In Chapter 7, we associated the element x — ¢, of H to a nonzero solution x, y 
of Catalan’s equation x”? — y? = 1. In this chapter, we show by means of a z-adic 
argument that the minus component (x — ¢ Ae of x — Sy is a nontrivial element 
of H. This is the content of Theorem 8.3. We show that it easily implies Theorem IV. 
A result similar to, but weaker than, Theorem 8.3 was obtained by Y. Bugeaud and 
G. Hanrot [6]. 

Recall that saying that the element (x — ¢ Pa of H~ C AH is trivial means that 


(x — ee agi = a4 for some a € Q(¢,)* or, equivalently, 
ao op = at, 
x — Up) 


for some a € Q(é,)*. Since Q(¢,) does not contain any gth roots of unity, the 
element @ is unique. 


Definition Let x, y € Z be nonzero integers satisfying x? — y4 = 1. Suppose that 


(x — ey = a! for some a € Q(é,)*. Let w € Q be a qth root of me and set 
P 


w’ = w/a. Then w' is a qth root of See, Set 
P 


n=(w-w')!, 


The number 7 is associated to our nonzero solution of Catalan’s equation. Note that 
it does not depend on the choice of the gth root of w. Even though 7 is eventually 
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shown not to exist, the analysis of its arithmetic properties plays a central role in our 
proof of Theorem 8.3. 


Proposition 8.1 Let p, g be two distinct odd primes. Suppose that x,y € Z 
are nonzero integers satisfying x? — y4 = | and that (x — a ae = a! for 
some a € Q(f,)*. Then 


(i) we have a = —1 (mod z), 
(ii) the number n defined above is a unit of the ring Zé], 
(iii) the norm from Q,,) to Q of n is equal to 1. 


Proof (i) By Corollary 6.5, we have x = 1 (mod p). It follows that 2 divides x—¢,,. 


Moreover, the quotient tf = — = acl — 1 is invertible modulo z. Since 
ua) =—-1+ ee = es we have 
uA 
af = t!- mii —t!-' (mod x). 
—op aM 


Since p is totally ramified in the extension Q(¢,) of Q, the automorphism acts 
trivially modulo zz. It follows that t‘~! = 1 (mod z) and hence e? is congruent to 
—1 (mod z). 

On the other hand, multiplying the equation (x — ¢ ve = a! by its complex 
conjugate, we see that (av(a@))? = 1. Since Q(¢ oe does not contain any nontrivial gth 
roots of unity, this implies that ai(@) = | and hence a? = 1(modz). Combining 
this with the congruence a7 = —1 (mod 7), it follows that a = —1 (mod 7), as 
required. 


(ii) From the equality 


1\¢4 
n= (w-w yf = wt (1-=) 


- by) ND) oy 1), 
1-¢, x-%, 


it follows that n is contained in Q(¢,). By Corollary 6.5, the number x is congruent 


i is integral. Therefore, w is integral as well. 

p 

Similarly, the fact that we have w’? = (w/a)? = ae = —.(w’¢,) implies that 
P 

w’ is integral. Therefore, 7 is integral and hence is in Z[¢ pl. To see that it is a unit, 


we observe that w — w’ divides 


to 1 modulo p?—!. This implies that 


1q x—Cp x— (Sp) =Cp-t Up) Paes 
— = = + 1, 
a a a 


wt 


which is a unit by Exercise 8.1. Therefore, w — w’ and 7 = (w — w’)? are units as 
well. 
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(iii) Since 7 is a unit, the norm N(7) of n, taken from Q(¢ ) to Q, is equal to a 
unit in Z. In other words, we have N(7) = +1. We have the plus sign because the 
cyclotomic field Q(¢,) is complex. 

This proves the proposition. 


Proposition 8.2 Let p, q be two distinct odd primes. Suppose that x,y € Z are 
nonzero integers satisfying x? — y4 = 1 and suppose that (x — cae = a! for 
some a € Q(f,)*. Then we have 


q = 1 (mod p). 


Proof We compute a z-adic approximation to the norm of 7 and confront this with 
the fact that N(7) = 1. 

Since the degree of Q,(¢,) over the field of p-adic numbers Q, is equal to 
[Q(ép) : Q] = p — 1, we can compute the norm of 7 locally, i.e., from Q,(fp) 
to Q,. Let 


x—-1 


1-6, 


LL — 
The number ju is related to the numbers w and w’. We have 
w= ig and wt =—u(e,)(1 +). 


Recall that the unit 7 = (w — w’)? does not depend on the choice of the gth root w 


of re . We choose w € Q,(¢,) to be the value of the Taylor series for the gth root 


of = = 1+ py: we set 


I-¢p 
w=VYltu=) > ("") in Qy(Sp). 


j=0 


By Corollary 6.5, we have x = 1 (mod p?~!), so that 4 =0 (mod p?~”). Exercise 8.2 
implies therefore that the Taylor series does indeed converge to a number in Q,,(¢)). 


Claim For w' = w/a in Q,,,) we have 
(1/4 
wos — V1 +d) = -0 >> ( j Jeu 
j=0 
Herer € Z satisfies rq = —1 (mod p). 


Proof of the claim If p ¥ 1 (mod q), there are no primitive gth roots of unity 
in Q,(¢,). Therefore, there is a unique w’ € Q,(¢)) satisfying w’ = —U(Cp) 
(1 + e(z)). It can only be the gth root given by the Taylor series. 
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To see that this is also true when p = 1 (mod q), we observe that w’ = w/a = 
—1 (mod zr). The Taylor series expression for the gth root of the number —1(¢,) 
(1 + e(2)) is also congruent to —1 (mod zr). Since w’ is equal to a qth root of unity 
€ times the value of the Taylor series, we conclude that € = 1. It follows that w’ is 
necessarily equal to the number given by the Taylor series. This proves the claim. 

Next we use the Taylor series expression to compute the norm of 7. The gth 
power of u = w — w’ is equal to 7, so that N(7) = N(u)?. We perform our compu- 
tation in the p-adic ring Z,[¢,]. The maximal ideal of this local ring is generated 
by z. 

We do the calculation modulo *Z plé,]. Expanding the first few terms of the 
Taylor series for the gth roots, we find that 


u=w-w=V¥1ltetgvl+up) 
- (+2) +e (1+) (mod 122) 
q q 


_ r+l 
d Sp (mod 1”). 
(1 —¢,)U + 67) 


=a tap (et 
q 


By Exercise 8.1, the number ¢ + 1 is a unit of the ring Z[¢,,]. Therefore, its norm 
is equal to | and the norm of u is given by 


x—1 1—¢rt! ) i 
N(u) = 1+ d 
- TI( 7, ey ee 
c#l 
x—1 fc ‘ 
=l+ d : 
q 2 mise ee 
cAl 


The summand corresponding to ¢, is equal to 


1 al 7 ease nyt! 
(l—¢,) +67) -2 +42) 
—(r + la + O(n?) 
~ =n(2+r2) + O(73) 


1 
i (mod zr), 


where O(z*) denotes a term that is divisible by *. Since the Galois group of 
Q,(fp) over Q, preserves the ideal generated by zr, it follows that 


Loe sett, “i ; : 
(el eer 72 mod 7 or every € fy, 6 #1. 
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Since the number ret does not depend on ¢, we find that 


N@=1+~ 7 ap der ae OD ee aes Tey 


Here we use that (x — 1) divides x? = ((x — 1) /n), which in turn follows from 
the fact that x = 1 (mod p%~'). Since rg = —1 (mod p), we have 


N(n) = NUuyt = 14+ (% 54 (mod (x — 1)z). 


Since N(7n) = 1, this implies that wz and hence p divide 1 — q. 
This proves the proposition. 


Theorem 8.3 Suppose that p, q are distinct odd primes and that x, y are nonzero 
integers satisfying Catalan’s equation x? — y4 = 1. Then the minus component 
(x — a of x — ¢,, is a nontrivial element in the obstruction group H. 


Proof Suppose that (x — ¢ eam is trivial in H. Then we have 


x—Cp cuts 


x—USp) 


for some a € Q(¢,)*. By Proposition 8.2, we have g = 1 (mod p). 
We consider once again the unit 


w= J1+u+ Eyl + cu) 


that was defined above. We compute the norm of 7 = uw‘ using Taylor series 
expansions. This time we do our calculation modulo ju*. Since gq = 1 (mod p) 
and rg = —1 (mod p), we have r = —1 (mod p) and we see that the linear term 
+ ¢, (2) in the Taylor series expansion of u vanishes. 


We compute the degree 2 terms of the Taylor series of the gth roots. Recall that 
w= a so that 1(44) = 6p. We find 


u=w-w = Vl ++ oryl + uu) 
1 
1+ (a2 east + (fan?) nod 1 


2 
= 1465 (14+ fe — 122, ) (mod 1") 
‘i 2q? (1 —,) 
The norm of u is equal to 
es Lg 4x2 4 3 
N(u)=1+ oa (x — 1) pe aoe (mod 11°). 
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By Exercise 8.2, the sum of the series is equal to iy It follows that 


me 1- 
4 —1p—! 
24 12 
Note that (x — 1)? /12 is a w-adic integer. 
Since N(n) = 1, the number yp? = ((x — 1)/s)* divides raed - Ive. 


1 2 
N(n) = Nut = 1+ (mod 1°). 


“2 Lee in Z,[¢,,]. Since Corollary 6.5 says that 


x = 1 (mod p?~!), it follows that p?—! divides (1 — qe. Since 23/3 divides p, 
it follows that p?~! divides (¢ — 1)p and hence that p%~? divides g — 1. This is 
impossible since p?~? is larger than g — 1. 

This contradiction proves the theorem. 


This implies that x — | divides 


Corollary 8.4 Let p, q be odd primes and suppose that q does not divide h, or 
that p does not divide h;. Then Catalan’s equation x? — y4 = | has no nonzero 
solutions x,y € Z. 


Proof By symmetry, it suffices to consider the case where g does not divide h>. 
By the remarks at the end of Section 7, we have H~ = CI [gq], so that the minus 
part H~ of the obstruction group H is trivial. However, by Theorem 8.3, a nonzero 
solution to Catalan’s equation gives rise to the nontrivial element (x — a in. 
It follows that there are no nonzero solutions to Catalan’s equation. 


Theorem IV of Chapter | is an immediate consequence: 


Theorem IV [/f p, g are odd primes and either p or q does not exceed 41, then 
Catalan’s equation x? — y4 = | admits no nonzero solutions x, y € Z. 


Proof By Lemma 6.1, we have p #q. By symmetry, we may assume that p <q. 
From the table of minus class numbers (Table 7.1), we see the following. When 
p < 19, the minus class number h7, is equal to 1, so that g does not divide h, 
for any prime q. For these primes, the theorem is clear. An inspection of Table 7.1 
furthermore shows that for p = 23, 29, 31, 37, and 41, the minus class number he 
is equal to 3, 8, 37, 37, and 117, respectively. In each case, the prime divisors of h, 
are at most p and once again q, being larger than p, cannot divide h7,. Corollary 8.4 
therefore implies the theorem. 


With some more effort, one can extend Theorem IV to p = 43. See Exercise 8.5. 
However, for p = 47, this approach fails. The minus class numbers of Q(¢,,) and 


Q(¢139) are given by 
hy = 5-139, 
hig = 3°. 47-277? - 967 - 1188961909. 
It follows that when g = 139, we have that p divides h7 as well as q divides h7,. 


For the purpose of proving Theorems II and III in Chapters 11 and 12, we only 
need to know that Catalan’s equation x? — y? = 1 has no solution in nonzero 
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x, y € Zwhen one of p, q is at most 5. In order to deduce this from Corollary 8.4, it 
suffices to know that one hash, = | for p = 3 and 5. This can easily be established 
with standard techniques from algebraic number theory. See Exercise 7.4. 


Exercises 


8.1 Let p be an odd prime. Show that 1 + ¢, is a unit of the ring Z[Z,]. 
8.2 Leta €Q. 


(a) Show that the series }°,..o (eye converges for t € R with |t| < 1. Show 
that the sum is equal to (1 + t)®. 

(b) Let p be a prime number and suppose that a = r/s with r,s € Z and 
s # 0 (mod p). Show that the series a, Gye converges fort € pZ, 
and show that the sth power of the sum is equal to (1 + 1)’. 


8.3 Check that the double Wieferich criterion of Chapter 10 holds for the pair p = 
47 and gq = 139 and conclude that Catalan’s equation has no nonzero solutions 
for these exponents. 

8.4 Let n be a natural number and let jz, denote the group of nth roots of unity 
in C*. For every k > 1, we lets, = >° ao: We have s; € Q. 


CEMn 
cA1 


(a) Show the following equalities of power series in Q[[X]]: 


k 
ys =e | Ta- => 


k>1 CEMn 
cAl 


ige (S — Xj" — -) 
—nX 
n—1 (n — 1)(n —5) 
2 24 


ROC): 


(b) Show that one has 


Bre 7 
= $2 sy= . 
_ ry 
aah (1-4) 12 
cl 
8.5 Extend Theorem IV to p = 43: 


(a) show that hj, = 211; 
(b) show that 


hy = 37.77. 41-71 - 181-2817 - 421 - 1051 - 12251 - 113981701 - 4343510221, 


and conclude that the hypothesis of Corollary 8.4 still holds. 


9 
The Stickelberger Ideal 


Let p be an odd prime and let ¢, denote a primitive pth root of unity. Let 
G= Gal(Q(é,,)/Q). We have G = {oy : a € (Z/pZ)*}, where o, is defined 
by oa(Sp) = ¢;. The Stickelberger ideal is an ideal of the group ring Z[G]. In 
this chapter, we establish some of its basic properties. See also [25] and [50]. We 
determine the Z-rank of the Stickelberger ideal. This is the content of Theorem 9.3. 
The most interesting property of the Stickelberger ideal is that it annihilates the ideal 
class group of Q(¢,). This is the content of Theorem 9.6. 


Definition The Stickelberger ideal is the ideal of Z[G] generated by the elements 


Recall that [x] denotes the integral part of x € R. The Stickelberger element is 
6, = S?-\aoz!. The G-trace is the element T = ©?7/0, of Z[G]. 


Lemma 9.1 Let p be an odd prime and let G = Gal(Q(é,,)/Q). Then the following 
relations hold in the group ring Z[G]: 

(i) On4i = 9; + Oy for everyi € Z; 

(ii) AG —i)0p = —6; for every i # 0 (mod p); 


(iii) 0; + Op-i = 9p — T for every i 0 (mod p); 
(iv) for every j #0 (mod p), we have 


0j0,) = —p0; + j@p, 
09} = Gj — 10; foralli # 0 (mod DP). 


Proof Parts (i) and (ii) are immediate from the definition of 6;. Note that for every 
i #0 (mod p), the coefficients of (o; — i)@, are divisible by p. For (iii), note that 


pol : (p —i)a pol 
6 +41 = 3 (|| Ae je) o'= xc —o;'=6,-T 


a=1 e P a=1 
for every i prime to p. Part (iv) is also easily verified. 
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Definition Using the notation of Lemma 9.1, we define for i € Z the following 
elements in the ring Z[G]. Recall that t = o_. 


Si = G41 — G3, 
e=U-dfi. 


Proposition 9.2 As a group, the Stickelberger ideal is generated by the G-trace T 
and by the elements f; for 1 <i < (p— 1)/2. 


Proof Let J be the subgroup of Z[G] generated by the elements f),..., f, pol and 
by the G-trace T. Since 6; = 0, the group J is also generated by T and by the 
elements 62,..., @0+1. From Lemma 9.1 (iii), it follows that T and hence all of J is 
contained in the Stickelberger ideal. Conversely, it suffices to show that the elements 
in J generate the Stickelberger ideal as a group. Taking i = bee in the relation of 
Lemma 9.1 (iii), we see that 6, — T € J. This implies that 6, ¢ J. The same 
formula implies that 6; € J for pot <i < p-—1.It follows from Lemma 9.1 (iii) 
that J contains the elements 6), 62, 63,...,9,. The periodicity relation of 
Lemma 9.1 (i) implies then that we have 6; € J for every i € Z. Since the 6; 
generate the Stickelberger ideal over Z[G], it suffices now to show that J itself is a 
Z[G]-ideal. This follows at once from the relations in Lemma 9.1 (iv). 


Theorem 9.3 Let p be an odd prime number and G = Gal(Q(¢é mh Q). 


(i) The elements f;, for 1 <i < Bo together with the G-trace T form a Z-basis 
for the Stickelberger ideal of Z[G]. 
(ii) Let I be the ideal of Z[G] that is the product of the Stickelberger ideal by (1 —1). 


The elements e;, for 1 <i < pol form a Z-basis for I. 


Proof By Proposition 9.2, the elements f;, for 1 <i < po together with the 
G-trace T generate the Stickelberger ideal as a group. In order to show that the 
generators are independent, it suffices to show that the Z-rank of the Stickelberger 


ideal is equal to the number of generators aa Similarly, since (1 — 1)T = 0, the 
elements e; = (1 — fj, forl <i < po generate J as a group and, in order 
to prove (ii), it suffices to show that the Z-rank of J is equal to bot The Z-ranks 
of these ideals are equal to the C-dimension of the ideals generated by the same 
elements in the group ring C[G]. We tensor the group ring therefore with C. It 
follows from the formulas of Lemma 9.1 (i) and (ii) that the ideal generated by 
the Stickelberger ideal inside C[G] is generated by the Stickelberger element 0). 
Similarly, the ideal generated by / inside C[G] is generated by (1 — 1)@p. 

For every character x : G —> C*, we let x : C[G] —> C denote the C-algebra 
homomorphism induced by x. In other words, for 9 = Loegxza € C[G], we 
set x(0) = YoeGXo X(o). Taking the product over all characters x, we obtain the 
C-algebra isomorphism 


j:cie] — [Jc 
x 


which is given by j(@) = (x(@)),. 
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Consider the element 0, = 2 eS aoe ' of the Stickelberger ideal. The isomor- 
phism j maps it to the vector (x (Op) , + When x is an odd character, i.e., when 
x(t) = —1, we have 


p-l 


Xp) = Yo ax(oa) | = pBiy. 


a=1 


Here B,,, is the generalized Bernoulli number that is defined in chapter 7. By Propo- 
sition 7.5, the product over the odd characters x of —5By, x is equal to hy /2p. In 
particular, it is not zero. It follows that B,,, and hence x (@,) are not zero for each 
of the _ odd characters x. 

When x is even, we have that 


(p-1)/2 (p-1)/2 


XO) = Yo (at+(p—a))xay! =p D> xa). 


a=1 a=1 


The latter expression is 0 when x ¥ 1, while it is equal to p(p — 1)/2 for x = 1. 
It follows that x(0,) 4 0 for precisely aa different characters of G. Therefore, 
the C[G]-ideal generated by the Stickelberger ideal inside C[G] has dimension Be 


This implies that the eel elements of the Stickelberger ideal are independent, as 


required. 

Similarly, since x (1 — ¢) is zero if x is an even character of the Galois group G, 
while it is equal to 2 when x is odd, we have x((1 — 1)0,) € O precisely for the 
a odd characters of G. It follows that the C[G]-ideal generated by the Z[G]-ideal 


TI inside C[G] has dimension po This implies that the Bo elements e; of J are 
independent, as required. 
This proves the theorem. 


Fora,i € Z, we set njg = [F] and mj.q = Ni+1,a—Ni,a- Then we have, fori € Z, 


p-l 
“A 
fi =G441-G = y MiaFg - 


a=1 
For a,i € Z, we define the coefficients u;q by 
p-l 


e¢=(1-)f, =) ujaoz'. 


a=1 
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Proposition 9.4 Using the notation introduced above, exactly (p — 1)/2 of the 
coefficients ujg With 1 < a < p — 1, are equal to I while the other (p — 1)/2 are 
equal to —1. There is a symmetry given by ep_\_-; = @ for 1 <i < (p— 1)/2 


Proof Since mj,q = = [tbe el oe [/4], it follows from Exercise 9.3 that the coefficients 
Mj,q are either O or 1. By Exercise 9.4, the element (1 + 1) f; is equal to the G-trace 
T = Xoeco. Therefore, the sum mj,q + mj,p—-a is always equal to 1. It follows 
that exactly one of mj,q, Mj,p—a 18 zero, while the other is 1. In particular, exactly 
(p — 1)/2 of the coefficients m;,, are 0 and the other (p — 1)/2 are equal to 1. Since 


we have e; = (1 — J) f; for| <i < P=! We see that exactly one of the coefficients 


Ui,a» Ui,p—a 18 1, while the other is Sana to —-1. 

Finally, it is easy to see that there is a symmetry given by f,-;-; = fj for 
0 <i < (p — 1)/2, and, therefore, there is a symmetry given by ep,_)_; = e;. This 
proves the proposition. 

We give a numerical example. Let p = 7. Tables 9.1—9.3 list the elements 6;, fi, 


and e; fori = 1,..., 6. Recall that fe_; = f; and e¢_; = e; in this range. 


Table 9.1 Elements 6; = )>?~ ; NiaO, | 


i oni N22 Nig Nia Nis Nie 
1 O 0 0 0 0 0 
2 O 0 0 1 1 1 
3 0 0 1 1 2 2, 
4 0 1 1 2 2 3 
5 0 1 2 2 3 4 
6 0 1 2 3 4 5 


Table 9.2 Elements f; = 6;,; — 6; = bs 1 MNj,aFq 


i mat M2 M3 Mia Mis Mis 
1 O 0 0 1 1 1 
2 0O 0 1 0 1 1 
3 0 1 0 1 0 1 


Table 9.3 Elements e; = (1 — 0) fp = 0? uiaoz! 
io oui Ui2 U3 Uia Us Ui 
1 ef? set 221 1 eal 
Or ay M4 1 <4 1 ot 
So = 1 1 -1 =i 4 


The following theorem is the key ingredient in the proof of Stickelberger’s 
theorem. In his 1890 paper, L. Stickelberger [47] proved his result for arbitrary 
cyclotomic fields. We prove it here only for the cyclotomic fields Q(¢,). In that 
case, the result was already obtained by E.E. Kummer [23] in 1847. 
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Fig. 9.1 Ernst Eduard Kummer (1810-1893) 


Theorem 9.5 Let p be a prime and let | be a prime ideal of the ring Z{¢,,| of 
degree 1. Then (° is principal for every element 0 of the Stickelberger ideal. 

Proof Set F = Q(¢,) and let G = Gal(F'/Q). As before, we write G = {o, : a € 
(Z/pZ)*}, where oa(¢,) = ee Let [ be a prime ideal of Z[¢,,] of degree 1. Its residue 
field is F,, where / is a prime congruent to | (mod p). Set A = Gal(Q(¢,)/Q). The 
restriction maps induce an isomorphism Gal(F(¢;)/Q) —> G x A 


Fé) 


F QG) 
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We let G act on F(¢;) by setting o,(¢;) = ¢. Similarly, we write 
A= {p-:¢ € (Z/1Z)"}, 


where p,(¢;) = ¢/, and we extend the action of A to F(¢;) by setting p.(fp) = ae 
We have G = Gal(F(¢;)/Q(¢))) and A = Gal(F(¢))/F). 

We fix a primitive /th root of unity ¢ and a nontrivial character xy from (Z/1Z)* 
to the group of pth roots of unity yz, in Z[¢,]*. Then we define the Gaussian 
sum tT by 


C= ye XG. 


x(mod /) 


The Gaussian sum Tt is contained in the ring Z[¢,,, ¢;]. By Exercise 9.6, we have 
p(t) =x ‘(c)t foreach p, € A. 


This shows that t? is A-invariant and is hence contained in the p-th cyclotomic 
field F. The subfield F(t) of F(¢;) is a Kummer extension of F. By Exercise 9.9, the 
Kummer homomorphism 


Gal(F(t)/F) — Hom((t?)F*?/F*?, wp) 


given by o +> g, where g is determined by g(t) = o(t)/T, is an isomorphism of 
G-modules. Here G acts on Gal(F'(t)/F) by conjugation and acts on the right-hand 
group Via 0,(g)(T) = oa(y(o, !(t))) for every og € G. 

Since tT is contained in the cyclotomic field F(¢;), the extension Q C F(t) is 
abelian and the action of G on Gal(F(t)/F) is trivial. It follows that the action by G 
on Hom((t?) F*?/F*?, yp) is also trivial. This means that G acts on (t?) F*? /F*? 
the same way it acts on j,. In other words, we have 


op(t”) = t”” - (pth power) 


for all b € (Z/pZ)*. 
By Exercise 9.6, the Gaussian sum T is an algebraic integer that divides /. There- 
fore, we have the following factorization of the F-ideal generated by t?: 


(?) = [J oto" 


04EG 


for certain integers x, satisfying 0 < x, < p. Applying the automorphism oz, 
we find 


o,(t?) = lg Opa" = I] o,'()* - (pth power). 


O4EG O,EG 
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Replacing a by ab in the product on the left, we obtain 
Xba = bxq (mod p), foralla, b € (Z/pZ)*. 


Setting a = 1, this gives x, = bx, (mod p) for all b € (Z/pZ)*. We claim that 
x; 4 O (mod p). Indeed, if x, = 0 (mod p), then x, = O (mod p) for every 
b € (Z/pZ)*. This implies that the F-ideal generated by t” is a pth power. It follows 
from Exercise 9.8 that the extension F C F(t) is unramified at /. This cannot be 
true, since F(t) is a degree p subfield of the totally ramified extension F'(¢;) of F. It 
follows that x is invertible modulo p and that the set {x, : a € (Z/pZ)*} is equal to 
{1,2,..., p — 1}. In particular, there is an element a € (Z/pZ)* for which x, = 1. 
We have x,, = b for all b = 1,2,..., p — 1 and hence 


oa(t”) = | 0,00" = [] 9, = [] 9, 100". 


opEeG opEeG opEG 


We find that 


("= o,(t”), 
where 6, € Z[G] is the Stickelberger element 6, = SPs bo, 1 in particular, we 
see that the ideal (° is principal. 

Applying the elements o; — i € Z[G], we find by Lemma 9.1 (ii) that 


(P% = go, (r%-W?P), 


We claim that the element t'~® is contained in F. Indeed, for each c € (Z/IZ)*, 
we have 


(oi-Nbe=1) —. plbe- Wei!) — 


This follows from the fact that t”~! is contained in the group /, of the pth roots of 
unity and that the Z[G]-module jz, is annihilated by o; — i for alli 4 0 (mod p). 

It follows that this is an equality of the pth powers of F-ideals. Since the ideal 
group is torsion-free, we obtain the relation 


Sag), 


Therefore, (° is principal for every element 6 of the Stickelberger ideal, as required. 


The proof of Stickelberger’s theorem is concluded by showing that the ideal class 
group is generated by the classes of the prime ideals of degree 1. The usual proof 
to show this relies on a density theorem and on class field theory. We present a 
different, more elementary proof due to E.E. Kummer [23]. See also [28]. 


Theorem 9.6 Let p be prime number. Then the Stickelberger ideal annihilates the 
ideal class group of Q(é)). 
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Proof By Theorem 9.5, it suffices to show that the ideal class group of Q(¢,) 
is generated by the classes of the primes of degree |. Thus, let q be a prime 
ideal of Z[¢,,] of degree f > 2. By Kummer’s Lemma (Exercise 9.7), we have 
q = (g, h(f,)) for some prime number g ¥ | (mod p) and some irreducible monic 
divisor h(X) of the cyclotomic polynomial ®,(X) € F,[X]. The degree of h is fand 
we have 


Ep . 
h(X) = | [cx - &”) 
j=l 


for some primitive pth root of unity € in F,. It follows that the constant term of H 
is equal to (—1)/ times a pth root of unity. Since f > 2, the only pth root of unity 
contained in F, is 1. Therefore, 


h(0) = (-1). 


Let H(X) € Z[X] denote a monic lift of h(X) with the property that H(0) = (— 1). 
For j = 1,2, we analyze the prime divisors of the elements H(¢,) + qi of Z[Ep]. 
Let t be a prime ideal of degree at least 2. Then it is of the form t = (7, h(¢,)) for 
some prime number r # | (mod p) and some irreducible monic divisor h'(X) of 
the cyclotomic polynomial ®,(X) ¢€ F,[X]. Let f denote the degree of h(X). The 
constant term of A(X) is equal to (— bf. 

The prime v divides H(¢,) + q/ if and only if the polynomial h(X) divides 
H(X)-+q/ in the ring F,[X]. This shows that necessarily f < f.If f = f, the fact 
that both polynomials are monic implies that the polynomials h(X) and H(X) + q/ 
are equal in the ring F,[X]. Inspection of their constant terms shows that r divides 
qi + (-1)f — (-1)! = q/. This implies that r = gq and hence t = q. Finally, 
we observe that q divides either H(¢,) + q or H(fp) + q° exactly once. Indeed, if 
this were not true, then q? would divide the difference g* — q. This is impossible, 
because q is unramified in Q(¢,). 

We conclude that f < f and hence that either the principal Z[¢, ]-ideal generated 
by H(¢,)+q or the one generated by H(¢,)+ q° is equal to the prime ideal q times 
a product of prime ideals of strictly smaller degree. Therefore, the ideal class of q 
is contained in the subgroup of the class group that is generated by prime ideals of 
degree < f. 

It follows by induction that q is in the subgroup of the class group that is generated 
by prime ideals of degree 1. Since q is an arbitrary prime ideal of degree > 2, we 
conclude that the class group is generated by prime ideals of degree 1. Theorem 9.5 
implies now that the Stickelberger ideal annihilates the ideal class group, as required. 


Exercises 


9.1 Let G be a finite abelian group and GY denote its group of characters x: 
G— C. 
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9.2 


9.3 


9.4 


9.5 


9.6 


9.7 


9.8 


9.9 


(a) Show that the map C[G] — Tyeev C defined by 6 +> (x(@))yegv is an 
isomorphism of rings. 

(b) Show that any ideal J of Tyeev C has the form [yea C x Tyga {0} for 
some subset A of GY. 

(c) Let 9 € C[G]. Show that the ideal generated by 6 has dimension m = 
#{x € GY : x(0) A O}. 


Let p be an odd prime and G = Gal(Q(¢,,)/Q). Show that in the ring Z[G] we 
have 6) = 0; = O and 6_; = —T. 

For x, y € R, show that [x] + [vy] < [x + y] < [x] + Ly] + 1. Let p be an odd 
prime and G = Gal(Q(¢,,)/Q). Show that the coefficients m;,q of the elements 
Fi € Z(G] are either 0 or 1. 

For x € R, show that [x] + [—x] = 0 when x € Z, while [x] + [—x] = -1 
otherwise. Let p be an odd prime and let f; = 6;4; — 0; be the elements in the 
group ring Z[G] defined in this chapter. Show that for every 0 <i < (p—1)/2, 
the element (1 + 2) fj is equal to the G-trace T. 

Let p = 11 and G = (Z/pZ)*. Compute the elements 6; € Z[G] for 1 <i < 
p — 1. Compute the elements fj, e; ¢ Z[G] for 1 <i < (p — 1)/2. 

Let / and p be odd primes. Let t(x) be the Gaussian sum associated with the 
character x : (Z/1Z)* —~> jp. Using the notation of this chapter, show that 


(a) the product of t(x) by its complex conjugate is equal to /, 
(b) oa(t(X)) = T(x“) for all a € (Z/pZ)", 
(c) po(t(x)) = x~'(b)t(x) for all b € (Z/1Z)*. 


(Kummer’s lemma). Let f € Z[X] be an irreducible monic polynomial. Let 
a denote a zero of f. Suppose that the ring Z[a] is the ring of integers of the 
number field F = Q(w). For every prime number /, let [].)¢ (moa 8° be 
the factorization of f (mod /) into a product of powers of distinct, irreducible 
polynomials g. Show that 


PS [el es) 


gif (mod 1) 


where [ is the Z[a]-ideal generated by / and g(a). 

Let n € Z> 1. Let F be a number field and a € F*. Show that any ideal p of 
F that is coprime to n is unramified in the extension F C F(./a) if and only if 
ordy(a) = 0 (mod n). 

(Kummer theory) Let K be a field and let n be a positive integer not divisible 
by the characteristic of K. Set E = K(¢,) and G = Gal(E/K). Let A be a 
finitely generated subgroup of E* with the property that the image A/(AN E*") 
is a Z[G]-submodule of E*/E*”. By E (</A) we denote the extension of E 
generated by the nth roots of the elements of A. 


(a) Show that E (x/A) is a finite Galois extension of K containing E = K(¢,) 
as a subfield. 
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(b) Consider the Kummer map E*/E*" —> Hom(Gz, ty) given bya bh f, 
where f(o) = o(x/a)//a for all o € Gg. Show that it is an isomor- 
phism of Z[G]-modules. Here Gg = Gal(E/E) and the G-action on 
Hom(G g, /,) is defined by (tf)(o) = o(f(t~!or)) for anyo € Gg. 

(c) Show that the isormorphism of part (a) induces a G-isomorphism 


A/(AN E*")  —+ Hom(Gal(E(WA)/E), Ln). 
(d) Consider the “dual” Kummer isomorphism 
Gal(E(W/A)/E) —> Hom(A/(AN E*"), ttn) 


given by o +> g, where g(a) = o(X/a)/x/a for all a € A. Show that this 
is an isomorphism of Z[G]-modules. 

(e) Show that every finite extension E C L that is Galois over K and for which 
Gal(L/E) has an exponent dividing n is of the form L = E (A) for some 
finitely generated subgroup A C E* with the property that A/(A M E*”) is 
a Z[G]-submodule of E*/E*”. 

(f) Suppose that A C K*. Show that tot~! = 0? for all o in the Galois 
group Gal(E(x/A)/E) and all tr € G. Here w : G —> (Z/nZ)* denotes 
the character given by t(¢) = ¢° for all t € G. 

(g) Show that if the extension K C E (s/A) is abelian, we have t(a) = 
a®©™) (mod E*") for alla € A and allt € G. 


10 
The Double Wieferich Criterion 


When x, y € Z are nonzero solutions to Catalan’s equation x”? — y? = 1, Cassels’ 
theorem (Theorem 6.4) says that g divides x and p divides y. In this chapter, we prove 
a stronger version of this statement: We show that g* divides x and p? divides y. 
Using this result, one easily proves Theorem I of chapter 1. 

Let p be a prime and G = Gal(Q(¢,,)/Q)). The Stickelberger ideal of the group 
ring Z[G] was introduced in chapter 9. The Z[G]-ideal J was also introduced there. 
It is defined as the product of the Stickelberger ideal and the element | — v. Here 
t € G denotes complex conjugation, i.e., we have 1(¢,) = ieee We recall the defini- 
tion of the obstruction group H. We have 


H = {a € Q(é,)* : ord.(w) = 0 (mod q) for all primes t 4 p}/Q(¢,)*?. 


Here p denotes the prime ideal (1 — ¢,,) of the ring Z[¢,,]. See chapter 7 for the basic 
properties of the group H. 


Proposition 10.1 Let p, g be two distinct odd primes. Then the Z[G]-ideal I defined 
above annihilates the obstruction group H defined in chapter 7. 


Proof Recall the exact sequence 
OSS gl Be ae A oe “Chee 0 


of chapter 7. Theorem 9.6 says that the Stickelberger ideal annihilates the class 
group and hence the group Cl,[q]. By Lemma 7.1 (iii), the element 1 — 1 
annihilates E’, / E%,. It follows that the ideal J annihilates H, as required. 


Theorem 10.2 Suppose that p, q are odd primes and that x, y are nonzero integers 
satisfying Catalan’s equation x? — y4 = 1. Then q? divides x and p? divides y. 


Proof By Lemma 6.1, we have p 4 q. By symmetry, it suffices to show that q? 
divides x. Let 6 be an element of the ideal J introduced above. By Proposition 7.2, 
the number x — ¢, is contained in the obstruction group H. Since J annihilates H, we 
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have (x — ¢,)° = a4 for some a € Q(é,)*. Multiplying by the gth power (—¢;')? 
and applying 1, we find that (1 — ¢,x)’ = B4 for some B € Q(¢,,)*. By Cassels’ 
theorem, q divides x. This shows that both elements (1 — ¢,x)® and £ are integral 
at the primes of Z[¢,,] that lie over g. Therefore, it makes sense to speak of their 
images in the quotient ring Z[¢,,]/(q) or Z[¢,,]/ (q7). The congruences below are to 
be understood in this sense. Since x = 0 (mod q), we have 1 = 6% (mod q). Both 
sides are gth powers in Q(¢,). Therefore, Exercises 10.1 and 10.2 imply that the 
congruence holds modulo g?. It follows that 


(1 —¢,x)° = 1 (mod q’). 


We write 6 = >, -¢ Ngo and expand the product (1—¢,x)" = [],<g(l-o (f))x)”". 
Since x = 0 (mod gq), it follows that 


(oe ye Ngo (Cp)x = 1 (mod q?). 


ocG 


If q° does not divide x, we must have Dee NgO (fp) = 0 (mod gq). Since the roots 
of unity {o(¢,) : o € G} form a Z-basis for Z[¢,], this implies that all coefficients 
Ng Of 6 are divisible by q. 

Since @ was arbitrary, it follows that every element of the ideal J C Z[G] has 
all its coefficients divisible by g. But this is not true. Indeed, by Proposition 9.4, the 
elements e; € I have the property that all their coefficients are equal to +1. For 
instance, the element e, = (1 — t) f2 = (1 — 1)) € J is equal to 


I<a< 25! PE! <a<p— 1 


Therefore, g* must divide x and the theorem follows. 
Theorem I follows from this. 


Theorem I (P. Mihdilescu, 2000) Suppose that p, q are odd primes and that x, y 
are nonzero integers satisfying Catalan’s equation x” — y4 = 1. Then we have 


p!-'=1(modq’?) and g?~!= 1 (mod q’). 


Proof By Corollary 6.5 of Cassels’ theorem, we have x — 1 = p?~!a4 for a certain 
a € Z. Since x = 0 (mod q) and since p?~! = 1 (mod q) by Fermat’s little theorem, 
we have —1 = a? (mod q). Since both —1 and a4 are gth powers in Z, Exercise 10.1 
implies that 


—1 =a!‘ (mod q’). 
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Since q? divides x, it follows from Theorem 10.2 that 
—1=x—-1= pt a? = pt™|(-1) (mod q”). 


This proves the first congruence. The second follows by symmetry. 


We conclude this section by phrasing Theorem 10.2 as a statement about the 
element x — ¢, in the group H. Consider the following subgroup of H: 


S = {a € H: ais aq-adic gth power}. 


Here we call a € Q(¢,,)* “a qg-adic qth power” if it is a gth power in the completion 
F, of Qe) at each of the primes q lying over g. The group S is called a Selmer 
group. We have 


S =ker | H — ] | #3/Fi" 
ala 


Corollary 10.3 Suppose that p, g are odd primes and that x, y are nonzero integers 
satisfying Catalan’s equation x? —y? = |. Then the image of x—¢,, in His contained 
in the Selmer group S. 


Proof By Theorem 10.2, the element 1 — Gok is congruent to | modulo q?. Since 
q is odd, Exercise 10.4 implies that it is a gth power in the completions of Q(¢,) 
at all primes over q. Since —¢, is a gth power in Q(¢,,), we deduce that x — ¢,, is 
“a q-adic qth power’, as required. 


Exercises 


10.1 Let R be a commutative ring and let g be a prime number with the property 
that R/qR has no nonzero nilpotent elements. Show that if a,b € R satisfy 
a’ = b‘ (mod qR), then they also satisfy a7 = b? (mod q7R). 

10.2 Let n be a natural number and gq be an odd prime. Show that the ring Z[¢,]/(q) 
contains nonzero nilpotent elements if and only if g divides n. 


“The group H is equal to the flat cohomology group H4,,(X, dq). Here jug is the group scheme of 
the gth roots of unity and X denotes the scheme Spec(Z[¢,, ni The subgroup of a € H with the 
property that the extension Q(¢,) C Q(¢,, </a) is unramified at the primes q lying over g makes 
up the étale cohomology group H4,(X, }4,). Since the elements @ contained in the subgroup S have 
this property, the group S can be identified with a subgroup of H}.(X, ,). More precisely, it is the 
Selmer group 


ker HX, Mq) — I] H' (Spec( Fy ), Ug) 
ala 


68 Catalan’s Conjecture 


10.3 Let g be an odd prime and O be a finite unramified extension of the ring Z, 
of p-adic integers. Show that the multiplicative group (1 + ¢O)% is equal to 
1+¢70. 

10.4 Let p, g be odd primes and suppose that x, y are nonzero integers that satisfy 
Catalan’s equation x”? — y? = 1. Deduce from Corollary 6.5 that the following 
three statements are equivalent: 


(a) x — ¢, is contained in the subgroup S of H defined above; 
(b) q° divides x; 
(c) p?—! = 1 (mod gq’). 


11 
The Minus Argument 


In this chapter, we prove Theorem III of chapter 1. The proof is “Archimedean” 
in the sense that it exploits the fact that for any nonzero solution x,y of Catalan’s 
equation x? — y7 = 1, the absolute values of x and y are necessarily very large. This 
follows from Corollary 6.5 (iii). We use the notation introduced in chapter 7. 


Lemma 11.1 Let p,q be distinct odd primes and G = Gal(Q(¢,)/Q). Suppose that 
x,y are nonzero integers satisfying Catalan’s equation x? — y? = |. Then the map 


{0 € ZIG]: — f)" € QE} —> QE" 
that maps @ to the element a € Qé,,)* for which 
(x — gp)" =a! 
is an injective homomorphism. 


Proof Since Q(¢,) does not contain any primitive qth roots of unity, the element a 
is unique and the map is a well defined homomorphism. Suppose 6 = )°,2g Noo 
is in the kernel. This means that [Tee — o(¢,))"" = 1. The ideals (x — o(€,)) 
have at most the prime factor 1 — ¢, in common. In fact, by Corollary 6.5, we have 
x = l(modp). Therefore, the ideals (x — o(¢,)) actually do have this factor in 
common. It follows that the principal ideals generated by 


xX — o(Sp) 


i-¢, ; (o €G), 


are mutually coprime. We estimate their norms from the field Q(¢,) to Q. By 
Corollary 6.5 (iii), we have |x| > q?~' + q, so that |x — o(f,)| > q?7! for each 
o € G. It follows that for allo € G, we have 


as (p-1) 
v(: oe) > qd af 
1— op Pp 
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This shows that for every o € G, the ideal (x — o(€,)) is divisible by a prime ideal 
that does not divide any of the other factors in the product [[, eG — op)" =1. 
It follows that n, is zero for allo. 

We conclude that the homomorphism is injective, as required. 


Recall that by Proposition 7.2, the element x — ¢, modulo Q(¢,)*! is contained 
in the obstruction group 


H= {a € Q(¢,)* : ord, (a) = 0 (mod q) for all primes t 4 p} /Q(Ep)"4 


that was defined in chapter 7. The left-hand group in Lemma 11.1 is precisely the 
Z[G]-annihilator of the element x — ¢, in H. 


Definition For any element 6 = )°,-¢ 1.0 of the group ring Z[G], we define its 
size ||O|| by 


Ol] = >— Incl. 
ocG 
Proposition 11.2 Let p,q be distinct odd primes and let G be the Galois group of 
Q(é,) over Q. Suppose that x € Z satisfies |x| > 2 and that (x — ed = a! for 
some @ in the ideal (1—1) C Z[G] and some a € Q(é,)*. Then for every embedding 
p: Qé,,) <> C, the number $(a) is contained in the unit circle and there exists a 
qth root of unity —€ € C for which 


kal 


w@) Es TO 


Proof Let @ : Q(é,) — C be a fixed embedding. Since 0 = }°,-¢ noo is in the 
ideal (1 — 1) of Z[G], the absolute value of O((x — ey) is equal to 1. Therefore, the 
same is true for #(q@), so that it lies on the unit circle. 

Using the Taylor series expansion of the principal branch of the logarithm, 
Exercise 11.1 implies 


sere) \< Il 
Xx 


Ix] — 1’ 


|Arg($(a)*)| = llog((a)*)| < }> |nlllog (1 


oc€G 
and hence 


20k |6|| 
|Arg(p(a)) [-s for some k € Z. 
q q(x| — 1) 


The left-hand side of this inequality is precisely the length of the arc between the 


2mik 
complex numbers ¢(a@) and = e « . Since the arc is at least as long as the segment 
between these two numbers, the proposition easily follows. 
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In chapter 10, we defined the ideal J C Z[G] as the product of the Stickel- 
berger ideal by | — vu. If x, y € Z is a nonzero solution of Catalan’s equation, 
Proposition 10.1 implies that every 6 € J satisfies the condition (x — tay = al 
of Proposition 11.2. This explains the importance of the ideal J for the proof of 
Theorem III. Under the assumption g > 4p7, Mihdilescu shows that the ideal 
I Cc Z[G] contains many elements 6 of small size. This is the content of the next 
proposition. We will then use the box principle to derive a contradiction from this 
and deduce Theorem III. 


Proposition 11.3 Let p,q > 5 be prime numbers satisfying q > 4p. Then the 


ideal I contains at least q + | elements of size at most ret 


Proof Lets = [3 pal: By Proposition 9.3 (ii), the elements e; = (1—1) f; fori = 
1325 Ps pet form a Z-basis for the ideal J introduced above. By Proposition 9.4, 
half of the coefficients of e; are equal to 1, while the other ones are all equal 


to —1. Therefore, the size |le;|| of e; is equal to p — 1 for each 1 <i < (p — 1)/2. 
It follows that any element of the form 


(p—1)/2 
6= > Kiei, 


i=1 


1/2 


with A; € Zso satisfying paw 1 Ai <5, has the property that 


zs 


q 
< —1 y i; —— 


p—1 
By Exercise 11.3, there are (Fz) such elements. There are exactly as many 
elements ye Aie; with all coefficients 4; < 0 that satisfy yee [Ail <8. 
Therefore, there are at least 
p-1 
s+ 22 
2( ae )-1 
s 


elements in J of size at most st 


Now we apply Lemma 11.4 below with k = fot . Since s = —4, Ee “¥ x], this leads 
to the inequality 


pol 2 
| tie )-1 = 36+ De-D +1 Su geh A. 


This shows that there are at least g + 1 elements 6, as required. 
It remains to check that the conditions of Lemma 11.4 are satisfied: Since 
q > 4p’, the number s = boty! is at least 6. Since p > 5, we have k > 2. 
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Moreover, (k, 5) is not equal to any of the exceptional pairs that are listed in 

Lemma 11.4. Indeed, if k = 2, we have p = 5 and 100 = 4p? < q < 3(s +1) 

(p—1)* = 32(s+1)/3. This implies, s > 300/32 > 9. If k = 3, we have p = 7 and 

196 = 4p? <q < 2(s + 1)(p — 1)° = 24(s + 1), which implies s > 196/24 > 8. 
This completes the proof of the proposition. 


Lemma 11.4 For all pairs of integers (k, s) with k > 2 and s > 6 with the excep- 
tion of (k, s) = (2, 6), (2, 7), (2, 8), and (3, 6), we have 


k 4 
(2) > S64 DR +1, 


S 


Proof See Exercise 11.4. 


Corollary 11.5 Let p,q be odd primes satisfying q > 4p? and suppose that x, y € 
Z is anonzero solution of Catalan’s equation x? — y1 = 1. Then for any embedding 
g : QSp) @ C, there exists a nonzero element 6 € I with ||@|| < 1 with the 


p-l 
property that (x — co). = a! for some a € Qé,,) with 


216 Il 


—1 —_—.. 
"Oat = sa a5 


Proof By Proposition 7.2, the number x — ¢, modulo Q(¢,)*4 is contained in the 
obstruction group H. By Proposition 10.1, the ideal J annihilates H. Proposition 11.3 
ensures the existence of at least g + 1 elements 0 in J of size at most oe For each 


of these elements 6 € J, we have (x — bp) = a! for some a € Q(¢,) depending 
on 9; hence, by Proposition 11.2, 


Il 
= < es 
WOE) SS Fae 
for some qth root of unity € € C that depends on @. By the box principle, there exist 
then two distinct such elements 6, 92 satisfying this inequality for a, a2 with the 
same qth root of unity € = & = &. We set 0 = 0; — 6) anda = a /a2. Then we 
have (x — Cae = a7. Moreover, we have 6 # 0 and, by Exercise 11.2, we have 
[All < [All + ll < 24. We also have 


Ip(a)— 1] = |b(e1) — b(@2)| 
S  |P@1) — €| + |6(@2) — € 
21141 
q(lx| — 1)’ 


as required. 
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Theorem III (P. Mihdilescu, 2003) For any pair of odd primes p,q with q > 4p* or 
p > 4q’, there are no nonzero x, y € Z for which Catalan’s equation x? — y1 = 1 
holds. 


Proof By Theorem IV, we may assume p,q > 5. The proof is by contradiction. 
Suppose that x,y are nonzero integers that satisfy Catalan’s equation x? — y4 = 1. 
By symmetry, we may assume that g > 4p”. We fix an embedding ¢ : Q(ep) @ C. 
By Corollary 11.5, there exists a nonzero element 0 € J with ||@|| < ot with the 
properties 


(x —¢,)° =a? for some w € Q(¢,,) 
and 
lp@)—-1| = 
~ q(x] — 1) 


The same inequality holds for the complex conjugate of ¢(q@). For the remaining 
p — 3 embeddings ¢, we have |@(#) — 1| < 2. This follows from the fact that 
(a) = 1 for every embedding ¢. Taking the product over the embeddings @ : 
Q(é,) <— C, we obtain the following inequality for the norm from Q(¢,) to Q 


ofa —1: 
Wactiyt eC IY 
- = @ \ist—1) * 


In order to obtain a lower bound for N(a — 1), we first note that a — 1 is not zero. 
This follows from Lemma 11.1 and the fact that 6 4 0. We estimate the norm of its 
denominator J. The ideal J of Z[¢,] is also the denominator of a. So, the principal 
fractional ideal (@) is equal to J’/J for some Z[¢, ]-ideal J’ that is coprime to J. It is 
convenient to take gth powers and to consider the fractional ideal (~)4 = (x — Cire 
We have 


Gg.) = [[a-6@yr S795 


ocG 


Since 0 = (1 —1)w for some y, the norm of (x — fy is equal to 1. It follows that 
N(J') = N(J). We also have 


[[@-o@ ym! = s/ts9. 


ocG 


This implies that N(J)?4 = N(J'J)@ divides NULege= £y)!"e!), which is at most 
(|x| + LI@-DIFl, Therefore, 


aa 
NJ) < (xl lw. 
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On the other hand, since J(a~ — 1) is an integral nonzero ideal, we have 
N(J(a@ — 1)) => 1. Combining everything, we obtain the inequality 


— thy 2P-1 / 1a, \? 
(xj +72" < NW! < N@-l < ~~( ve 


o ge Nad 


Since |x| > q?~! > 20, we have |x| + 1 < ¥ (|x| — 1) and hence 


_ pol 16 ||6\I- , 
(x| + b> Wl = ey ae (*) 
The size of 6 satisfies ||0|| < ae so that 
16 
(xl +)? < —,2P1), 
(p= 1 


By Corollary 6.5, we have |x| > q?~!. Since p > 5, this leads to 
Gere. se: a EOIES ee RPO 


contradicting q > 5. 
This proves the theorem. 


In the proof of Theorem III, it is essential that the exponent 2 — 7m ||| to which 
|x| + 1 is raised in the left-hand side of the inequality (*) is positive. Here the 
positive contribution “2” in the exponent is the number of embeddings Q(¢,) > C 
for which the absolute value of the image of a — | is very small. To guarantee 
this, we made the assumption ||@|| < ot, but the choice of the constant 3 is rather 
arbitary. With some more work, one can prove a similar theorem for a somewhat 
larger constant. 


Exercises 
11.1 Let “log” denotes the principal branch of the complex logarithm function. 


(a) Show that |log(1 ae 
(b) Show that |log(xy)| < llog(x)| + |log(y)| for all somnplek numbers x,y that 
are not contained in Ro. 


11.2 Let p be a prime and let G = Gal(Q(¢,,)/Q). 


(a) For 8 € Z[G], show that ||@|| = 0 if and only if 90 = 0. 
(b) Show that |/6 + 6’|| < ||@|| + ||0’|| for all 6, 6’ € Z[G]. 


11.3 Let k and s be natural numbers. Show that the number of row vectors (A;,..., 
Ax) with A; € Z satisfying 0 < A; < s and am Aj < s is equal to the 
binomial coefficient ees 


11 The Minus Argument 715 


11.4 The goal of this exercise is to prove the inequality of Lemma 11.4. 
(a) Show that the inequality 


k 4 
ie) > 564+DE +1 


S 


holds for the pairs (k, s) = (4, 6), (3, 7), and (2, 9). 

(b) Show inductively that if the inequality holds for a pair (s, k), it also holds 
for all pairs s’, k’ for which s’ > s and k’ > k. 

(c) Prove Lemma 11.4. 


12 
The Plus Argument I 


In this chapter and chapter 14, we prove Theorem II of chapter 1. There are two 
main ingredients: the Runge method, exploited here, and Thaine’s theorem, which 
is used in chapter 14. While the proofs of the results of chapters 8, 10, and 11 involve 
objects like the minus class group and the Stickelberger ideal, both anti-invariant 
under complex conjugation, the present proof involves invariant objects like the 
group of cyclotomic units and the class group C ag of the field Q( Es ). 

Let p, g be distinct odd primes. We use the notation of Chapter 7. In particular, 
we have G = Gal(Q(¢,)/Q) and Gt = Gal(Q(¢F)/Q). For any s € Q, we let 
(1 + T)* denote the power series in Q[[T]] defined by 


copy = po ik 


k>0 


We formulate two propositions concerning power series. For a natural number q and 


an element 6 = )° 4 oo in the group ring Z[G], let 


F(T) = (1—¢,7)** = [[Q-o(,)TY"* € Q¢>)ITI, 


ocG 


where, as explained above, each factor in the product is defined by 


(l—o(¢,)TY*"4 = >)~ (ie "opr € Q¢))ITI. 


k 
k>0 


For each embedding @ : Q(¢,) <> C, we let F %(T) denote the power series in 
C[[T]] obtained by applying ¢ to the coefficients of F(T). For any t € C, substi- 
tuting J = in the power series F%(T'), we obtain a series of complex numbers. If 
this series converges, we denote its sum by F?(f). 


Proposition 12.1 Let é = )- 
the following. 


seg loo € Z[G]. Using the notation above, we have 


(i) The coefficients of F(T) are integral outside q. 
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(ii) The power series F(T) has the form 


= Ck rk 
CD ee Beara : 


k>0 
with cx, € Z[¢p] satisfying 


ce =(— Y o(fp)ng)* (mod q). 


oEG 


(iii) For any embedding @ : Q(ép) <> C and any t € C with |t| < 1, substituting 
T = t into F(T) leads to a convergent series. For any k > 0, we have 


[Fe — Feo) <|( Li 
Ne NED] = [epee 


Here Fo(T) is the polynomial that is the sum of the first k terms of F?(T) 
and Fi (t) is its value in the complex number t. By m we denote the rational 
1 


1 
number malta =; eee tel. 


Proof Part (i) follows at once from Exercise 5.7. To prove (ii), we consider the 
power series 


(1 —o(p)q@T)* 
Ng (Ng = q)- : ‘(Ng = (k = 1)q) 
k! 


G(T) = FQ@T) 


(—o(¢))T)*. 


k>0 


The coefficients of this power series are visibly of the form a;/k!, where a, is 
an element of Z[¢,] that is congruent to (—o(fp)no)* (mod q). An application of 
Exercise 12.2 to the domain Z[¢,] shows that G(T) is equal to >)... T* with 


k 
= (- rs os (mod q). 


oEG 


Since we have G(T) = F (qT), the result follows. 
To prove (iii), we observe that by Exercise 12.1, we have for every o ¢€ G, that 


(a) Z eras) = cor(“ 9), 
k k k 


It follows that the absolute values of the coefficients of the series 


‘ ("our 


k>0 
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are less than or equal to the corresponding coefficients of the series 


Dent = (_— pas 


k>0 


all of whose coefficients are positive. Therefore, the absolute values of the coeffi- 
cients of F?(T) are smaller than those of the power series 


_ — Hel —~(— FD, Itoh =~(|—_Ty"= TINY mk 
[[a-” aT 2) ls rm Ey 


o€G k>0 
It follows that 
[Fet)— FPO) < | — |e)" — ede) 


for t € C with |t| < 1. Here s,(7) denotes the sum of the terms of degree at most k 
of the Taylor series of the function (1 — T)~”. By Lemma 5.1 and Exercise 5.8, we 
have 


Kd = ep = sete < | lik 
- —s —_—_—______., 
k = k+ | el = |t|)@teel 


which implies the inequality of (iii). 
This proves the proposition. 


Proposition 12.2 We use the notation of the previous proposition. If8 = Y°,-¢ No 
is contained in the ideal (1 + t) of Z[G], then we have the following. 


(i) The power series F(T) is contained in the subring Qe UTI of the power 
series ring Q(¢y)[[ T]]. 

(ii) Suppose that t € Q satisfies |\t| < 1 and that (1 — té,)° = B4 for some 
element B € Qo). Then we have that F*(t) = $(B) for all embeddings 
$:QGt) OR 


Proof The fact that @ is contained in the ideal (1 + 1) means precisely that for 
allo ¢€ G, the coefficients n, and n,, of @ are equal. Since, for every o € G, the 
coefficients of the polynomial (1 — o(¢,)T)(U. — ou(f,)T) are contained in QF), 
part (i) follows. 

We prove (ii). Let @ be an embedding Qe,)t <> R. We extend it to an embed- 
ding Q(¢,) — C. Since ¢ is a homomorphism, we have (6)? = o((1 — fias!) = 
TI. «GU — tb(o(fp)))"". This number is contained in R. By ordinary analysis, its 
real qth root is equal to F%(t) . In other words, we have o(B)? = F9(t)1. Since q 
is odd, real numbers have unique gth roots. Therefore, we have ¢(8) = F9(t), as 
required. 
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Part (ii) of Proposition 12.2 addresses a delicate matter. In general, substitution 
in power series does not commute with Galois action. But here it does. 

In the rest of this chapter, we slightly abuse notation as follows. Consider the 
natural surjective homomorphism F,[G] —> F,[G*]. For any w € F,[G*], the 
product of (1+) by a lift of w to F,[G] does not depend on the lift of y. Therefore, 
we write (1 + )w for this element of F,[G]. 

Before proving the main result of this chapter, we prove a lemma. Recall the 
definition of the obstruction group H. We have 


H= {a € Q(¢,)* : ord.(w) = 0 (mod gq) for all primes t 4 p} (OC? 


Lemma 12.3 Let p, q be distinct odd primes and suppose that x € Z satisfies 
x = 1 (mod p). Suppose that y € F,[G*] annihilates the element (x—,)'*' of H™. 
Then there exists a lift@ = >~ ngo € Z[G] of one of the elements + + 1)w 
with the following properties: 


ocG 


© Ng =Ng, foralla € G; 

e nz => Oforeacho € G; 

© >, No = mg for some integer m satisfying 0 < m < (p — 1)/2; 
e (x- eye = a! for some algebraic integer a € OG 


Proof Let 6’ = er noo € Z[G] be the unique lift of (1 + uw € F,[G] with 
the property that 0 <n, < q for every o € G. Notice that automatically ng = no, 
for allo € G, so that 6’ is contained in the ideal (1 + ¢) of the group ring Z[G]. 
The element 0” = 0 -g(q —Nno)o € Z[G] lifts —(1 + ow and has its coefficients 
between | and g. It is also contained in the ideal (1 + 1). Both elements 6’ and 6” 
have nonnegative coefficients. Since we have 


6’ +6" = qa, 


ocG 


the sum of the coefficients of 6’+ 0” = q )°,¢¢ 0 is equal to g(p — 1). Therefore 
the sum of the coefficients of one of 6’, 0” is equal to mq for some rational number 
m Satisfying 0 <m < po. That’s the element 6 we take. By construction, we have 


(x —¢,)° = a4 for some a € Q(g,)*. 


Since the coefficients n, are nonnegative, the number (x — Ee is an algebraic 
integer. Since we have n, = n,, for allo € G, the element (x — ¢ oy is invariant 
under complex multiplication and is hence contained in Qe; ). The same is then 
true for a. 

Finally, to see that m is an integer, we note that x = | (mod p), so that = 1—¢, 
divides every conjugate of x — ¢, exactly once. It follows that 
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Yong = ordy (x — fp)’) = ord, (a) = 0 (mod q). 


o€G 


This proves the lemma. 


To put the next theorem in a context, recall that Theorem 8.3 says that if 
x? —y4 = 1 is a nontrivial solution of Catalan’s equation, then the minus component 
(x — ¢,)'~ of the element x — ¢, of the obstruction group H is nontrivial. We now 
prove a much stronger statement for the plus component (x — ¢ Py ae We show that 
it generates a free F,[G*]-module. 


Theorem 12.4 Let p,q > 7 be distinct primes and suppose that x,y € Zisa 
nonzero solution to Catalan’s equation. Then the F,[G]-submodule of H that is 
generated by the image of (x — ty is a free module over the ring F,[G*]. In 
other words, the F,{G*]-annihilator of (x — €,)'*' is trivial. 


Proof Suppose that wy € F,[G*] annihilates the element (x — ¢,)!*' of H. By 
Corollary 6.5, we have x = 1 (mod p), and Lemma 12.3 applies. Let 6 = 0, noo 
in Z[G] denote the lift of (1 +1), whose existence is guaranteed by Lemma 12.3. 
We have therefore 


(x — &)’ = | [@- 0)" = a! 


ocG 


for some nonzero algebraic integer aw € Q(¢é oo: Since q is odd and Q(¢é a ) does not 
contain any nontrivial gth roots of unity, the element @ is unique. We deal with the 
Diophantine equation 


] [* -o@) = v4 


oEG 


using Runge’s method. By Lemma 12.3, we have that )>,-¢ No = mq for some 
integer m > 0. It follows that for each embedding ¢ : Q(¢,) — C, we have 


No /q 
te =<" FL (1 22822)" a (2), 


Xx 
ocG 


Here F(T) is the power series that occurs in Proposition 12.1 and F?(T)eR[[T]] 
is the power series one obtains by applying ¢ to its coefficients. Let F,,,(7) denote 
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the polynomial that is the sum of the terms of F(T) of degree < m. Consider the 
element 


1 
— «-x"F, (=) E Qc7). 
x 


For every embedding ¢ : Q(¢,) <> C, we have 


1 1 1 
Io)| = |o(@) — x" FS (<) a ee (<) —x"Fe (=) 
Xx Xx Xx 
1 ( =i )I( 1 _ qn ( 1 i 
—— Bo ‘ 
m+1 |x| |x| |x| 


xl 
The inequality follows from Proposition 12.1 (iii) and the fact that we have | ( oat ) | = 


m+1 
() < 4”. See Exercise 2.4. 


Since the coefficients of the polynomial F,,,(t) have denominators, the number 
Z=a—x™ F,(4) is not integral. By Proposition 12.1 (ii) and (iii), the denominators 
of the coefficients of F,,(t) divide D = g”"*°s("), Therefore, 


m= o(e-rr(2) 


is integral. By the estimate above, for every embedding ¢ : Q(¢ a ) > R, we have 


IA 


qm 1 —2m—-1 
|f(Dz)| es gr (1 _ ~) ; 


|| 


Therefore, 


1 
log |@(Dz)| < —log jl+(m + ) log g+m log 4—(2m+1) log (1 - x) . 
Ge 


|x 


We confront this estimate with the fact that |x| is very large. By Corollary 6.5, we 
have |x| > q?~! + q. This implies that 


ieee = ie = eee), 
1 1 
oa (1 | :) < < 1g?! < 1/q?. 
x 


|x| —1 


See Exercise 11.1. Since we have 0 < m < (p — 1)/2, this leads to 


log |@(Dz)| a Pe 1 ( ms 1 re log *) Pp 
log(q) = 2 q-1 logq/ q?logq 
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Since g > 7, this is at most 


p-1l 1 log4 D 
1 , 
2 ( "ager * log 7 


which is negative for all p > 3. 

This proves that |6(Dz)| < 1 for every embedding ¢ : QF) <— R. It follows 
that the absolute value of the norm of Dz is less than 1. Since Dz is an algebraic 
integer, this implies that Dz is zero. Therefore, we have 


1 
Da = Dx"F,, (=) 
xX 
m 
= De ia ke = nk. 
qrk! 
k=0 


Since @ is an algebraic integer, Da is an algebraic integer that is divisible by g. By 
Proposition 12.1 (ii), all terms on the right-hand side are g-adic integers. Moreover, 
all are divisible by g, except possibly for the mth one. Then the mth term must, of 
course, also be divisible by g. This implies that the coefficient c,, is divisible by gq. 
By Proposition. 12.1 (ii), we have then that 


(- root) = (0 (mod q). 


oEG 


By Exercise 10.2, the ring Z[¢,]/(q) does not contain any nonzero nilpotent elements 
and it follows that 


~ Noo (Cp) = 0 (mod q). 


o€G 


Since the set {o(¢,) : o € G} forms a Z-basis for Z[¢,], we deduce that the coeffi- 
cients n, satisfy ng = O(mod q) for all o € G. It follows that 6 = 0 (mod q) in the 
ring Z[G], so that y = 0 (mod q) is zero in Z[G*]. 

This proves the theorem. 

We remark that the binomial coefficient (e :) that appears in the preceding proof 
is equal to m times what is called the mth Catalan number. See [45, pp. 219-229]. 

In terms of the explanation of Runge’s method given in chapter 7, the relevant 
algebraic curve here is given by the equation 


[]&- eG" = vt. 


ocG 


84 Catalan’s Conjecture 


Since ng = Ng, for allo € G, this curve is defined over Q(¢ - ). From the fact that 
m= : >, Nois an integer, we deduce that the equation looks like 


X"4 + (lower-degree terms) = V%. 


Letting W = V/X", we see that the divisor at infinity has equation W? = 1. This 
is the pole divisor of the function X. So, up to Galois conjugacy, the function X has 
precisely two poles: a pole Q’ given by W= 1 anda pole Q given by W = ¢,, where 
€, denotes a primitive gth root of unity. The pole Q’ is defined over Q and the pole 
Q is defined over Q(¢,). The estimates given in the proof of Theorem 12.4 show 
that the function 


1 
Z' = V = xX" F,, (=) 
Xx 


remains bounded as the point P = (x, v) gets close to the pole Q’, i.e., when 
|x| — oo. Therefore, Z’ has no poles outside Q. The function Z = DZ’ with D = 
qm*erda(") has the same property and is, in addition, integral over the ring Z[X]. 
Since the field Q(¢ HA ) is totally real and the pole g is complex, the absolute values 
of #(Z'(P)) can be explicitly bounded for all embeddings @ : Qc ) @ R and, 
therefore, the integral points on the curve can be determined. 

This is what was done in the course of the proof of Theorem 12.4. As in the proof 
of Cassels’ theorem, the main point is to do the estimates uniformly in the exponents 
p and q. 


Exercises 
12.1 Let x € Rand letk € Z.o. Show the following. 


(a) We have |(;)| = |(}") 
(b) If x < 0, then the binomial coefficient (;) has sign (— 1). 


12.2 Let R be a domain of characteristic 0 with quotient field K and let J C R be 
an ideal. Suppose that the two sequences {a,} and {b;,} in R have the property 
that a, = a‘ (mod 1) and b, = b* (mod J) for some a, b € R and all k > 0. 
Show that the product of the two power series )°,.9 #7* and Y°,.o be Tk in 
K[[T]] is equal to >...) 47" with cz € R satisfying c, = (a +b)‘ (mod J). 

12.3 Let p be an odd prime number. Show that the ring of integers of Q(¢ Ms ) is equal 
to ZigF I. (Hint. see Exercise: 7.2.) 


13 
Semisimple Group Rings 


In this chapter we consider group rings of a finite abelian group with various coef- 
ficient rings. The material presented here is well known and can be found in several 
algebra books. 

Let I" be a finite abelian group and k be a field of characteristic not dividing #I”. 
Let k denote a separable closure of k. Every character xy :  —> k. takes its values 
in the subfield of k that is generated by the roots of unity of order #1”. The Galois 
group Gal(k/k) acts on the characters x : [1 —> kj. We let x°(g) = o(x(g)) 
for g € I ando € Gal(k/k). For any character y : G —> ky, we let k, denote 
the subfield of k; generated by the image of x. We have k, = k(¢q), where fq is a 
primitive dth root of unity and d is the order of x. Each field k, is a k[I”]-algebra 
via g-x = x(g)x for g € I and x € ky. In this way, for every character x, any 
k,-vector space is in a natural way a k[J”]-module. 

The group ring k[J"] is semisimple. More precisely, there is an isomorphism of 
k[I]-algebras 


KP] —> [] kx. 
x 


It is the k-linear extension of the homomorphism that maps g ¢€ I to the 
vector (x(g)),. Here x runs over the Gal(k / k)-conjugacy classes of characters 
xl — kp. 

For any k[J"]-module M and character x, we let M, = M @yrj k, denote its 
x-part. Taking x-parts is an exact functor from the category of k[G]-modules to the 
category of k,-vector spaces. We have a natural isomorphism of k[J”]-modules 


mM = [|™,. 
x 


Here x runs over the Gal(k/k)-conjugacy classes of characters x : [ —> kh. 
The following proposition is also valid for non-abelian groups I’. 


Proposition 13.1 Let I” be a finite abelian group and let k be a field of character- 
istic not dividing #I°. Then every exact sequence of k[I"|-modules splits. 
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Proof Any short sequence of k[J”]-modules 


0 ay eee: EG > 0 


splits as a sequence of k-vector spaces. We can therefore find a k-linear map 
h’ : C —> B with the property that h - h’ is the identity map on C. Now we take 
the “average” over the [’-conjugates of h’. In other words, consider h” : C —> B 
given by 


1 
W'(0) = ge Do ho). 


ocl 
Then h” is k[I"]-linear and h-h" is the identity map on C. The proves the proposition. 


Let k = F,, where g, is a prime number not dividing #J°. For each character 
x : G —> k7,, let O, denote the ring extension of the ring of g-adic integers Z, 
that is generated by the dth roots of unity. Here d is the order of x. The rings O, 
are discrete valuation rings. In particular, they are principal ideal domains. Since qg 
does not divide #/”, the maximal ideal of O, is generated by q. The residue field 
of O, is equal to k,. Each ring O, is a Z,[I”]-algebra via g - x = x(g)x forg € G 
and x € O,. In this way, for every character x, any O,-module is in a natural way 
a Z, (I ]-module. 


By Exercise 13.2, the map Hom(J’, O;) —> Hom(l, ky) induced by the natural 
homomorphism OY —» ky, is a bijection. The Galois group Gal(Q, /Q,,) acts on 
the characters x: We have x°(g) = o(x(g)) for g € Pando € Gal(Q,/Q,). This 
action is unramified and is compatible with the action of Gal(F, /F,) on Hom’, ky) 
that was defined above. 

The Z,-linear extension of the homomorphism that maps g € I” to the vector 
(xX(g))x is an isomorphism of Z,[J"]-algebras 


ZAP) => I] o- 
x 


Here the product runs over the Gal(Q, /Q,)-conjugacy classes of characters x of I”. 
Indeed, since this map is a bijection modulo q, it is surjective by Nakayama’s lemma. 
It is injective because both sides are free modules over Z, of the same rank. For any 
Z,["|-module M, we let M, = M @7,,r\ O, denote its x-part. We have a natural 
isomorphism of Z, [J”]-modules 


mM = [|™,. 
x 


Here the product runs over the Gal(Q, /Q,,)-conjugacy classes of characters x of I’. 
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Proposition 13.2 Let I" be a finite abelian group and q be a prime not dividing #I. 
Let M be a finite Z[I"|-module. Then the F,|I°|-modules M{q] and M/qM are 
isomorphic. 


Proof We may replace M by M ®z Z, and hence assume that it is a Z,[J”]-module. 
We consider one x-part M, at the time. Each M, is a finite module over the 
principal ideal domain O,. It follows then from the structure theorem of modules 
over principal ideal domains that M,[q] and M,/qM, are k,-vector spaces of 
the same dimension. Since we have F,[J”]-isomorphisms M[qg] = [| P M,[q] and 
M/qM = Ly M,/qM,, we see that M[q] and M/qM are isomorphic F,[I"]- 
modules, as required. 


Lemma 13.3 Let I" be a finite abelian group and let q be a prime not dividing #1. 
Let M,N be two torsion-free, finitely generated Z,|I"|-modules. Then M and N are 
isomorphic if and only if M ®z, Q, and N ®z, Q, are isomorphic Q, [I |-modules. 


Proof Tensoring the isomorphism Z,[I"] = |] , Ox with Q, we obtain an isomor- 
phism of Q, [I”]-algebras: 


Q,1r] — [| kx. 
x 


Here the product runs over the Gal(Q, /Q,,)-conjugacy classes of characters x of I” 
and K, = O, ®z Qs the quotient field of O,. Suppose M @z, Q, and N @z, Q, 
are isomorphic as Q,[J”]-modules. For each character x, taking x-parts gives us 
two free finite-rank O,-modules M, and N, with the property that M, @o, K, and 
Ny, @o, Ky are isomorphic K,-vector spaces. This simply means that they have 
the same dimension d. Since M and N and hence M, and N, are torsion-free, it 
follows that the O,-rank of both M, and N, is also d. This implies that M, and N, 
are isomorphic O,-modules. Since we have the Z,[I"]-isomorphisms M = || Mx 
and N = ]] , Ny, we see that M and N are isomorphic Z,[/"]-modules, as required. 


Lemma 13.4 Let I be a finite abelian group and let M and N be two finite- 
dimensional Q[I"]-modules. Then M and N are isomorphic if and only if M @®g R 
and N &g@ R are isomorphic R[I"]-modules. 


Proof Applying the theory of the beginning of this section to the field k = Q, we 
obtain a natural isomorphism of Q["]-algebras Q[J"] = Ie F,, where F, denotes 
the number field generated by the image of x. 

Tensoring with R, we obtain an isomorphism of R[J”]-algebras: 


RiP) = | F, @gR. 
x 


Here the products run over the Gal(Q/Q)-conjugacy classes of characters x of I’. 
For every character x, the R-algebra F, @q R is a product of copies of R and C. 
If M @g R and N @gq R are isomorphic as R[J”]-modules, then for every x, the 
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F,, ®g R-modules (M @q R) @r (Fy ®@q R) and (N Sg R) @r (F;, @q R) are 
isomorphic. 

For every character x : 1 —> Ey, let M, denote the F,-module M @q F,,. For 
every Q[J”]-module P, we have natural isomorphisms of F’, ®g R-modules 


P, @QR = (P @QR) @r (F; SQ R). 


We conclude that the R[J”]-modules M, ®@g R and N, @g R are isomorphic. It 
follows that these two modules have the same R-dimension. It follows that the 
Q-dimensions and hence the F’,-dimensions of N, and M, are equal. Therefore, M, 
and N, are isomorphic F,,-vector spaces. This implies that M and N are isomorphic 
Q[l’]-modules, as required. 


Corollary 13.5 Let I be a finite abelian group and let q be a prime not 
dividing #1’. Suppose that M and N are Z-free Z[G]-modules and that M ®z R and 
N @z R are isomorphic R|I"|-modules. Then M/qM and N/qN are isomorphic 
F,[I"]-modules. 


Proof Suppose that we have an isomorphism M@zR = N @zR of R[J]-modules. 
Lemma 13.4 implies then that there is an isomorphism M ®@z Q = N @z Q of 
Ql’ ]-modules. Tensoring with Q, over Q, we obtain an isomorphism M @7 Q, = 
N @z Q, of Q,[1"]-modules. Since M @z Z, and N @z Z, are torsion-free, 
Lemma 13.3 gives us an isomorphism M @z Z, = N @z Z, of Z,[J"]-modules. 
The result now follows by reducing this isomorphism modulo q. 


We conclude this section with an application to the p-unit group of the cyclotomic 
field Q(¢,). First we prove a lemma. 


Lemma 13.6 Let p be an odd prime and let E., be the group of p-units in Q(¢,). Let 
EX = on Oc) and Gt = Gal(Q(Z>)/Q). Then there is an an exact sequence 
of G* -modules 


h 
0 => jee eed = Ey/L2p =? Mop/Mp — 0, 


Where h is given by h(u) = u'! = W(u)/u. 


Proof For every u € E>, and for every embedding ¢ : Q(¢,,) > C, the absolute 


values of the images of u‘—! under ¢ are equal to 1. Therefore, Exercise 7.1 implies 
that h(u) is a root of unity. The group of roots of unity of Q(¢,) is 2), and for 
U € [4ap, the number u'~! = u~? is contained in jz». Therefore, the map / is a well- 
defined group homomorphism. It follows that the G-module E,,/{12) is -invariant 
and hence has the structure of a G*-module. The surjectivity of h follows from the 
fact that h(¢, — 1) = (¢, — 1)" = —¢5". If we have h(u) = u! € pp, then 
(ug)! = 1 for some ¢ € 2p. In other words, u = Cu’ for some uw’ € Ee. Finally, 
we have (12) M Ee = {+1}. These facts show that the sequence is exact. This proves 
the lemma. 
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Proposition 13.7 Let p be an odd prime and G* = Gal(Q(¢)/Q). Suppose that q 
is a prime not dividing p(p — 1). Then the p-unit group E, of Q(¢,) has the property 
that E.,/E% is free of rank 1 over F,(G*]. 


Proof Let ZS = E, 1 Q(é,). Since p is totally ramified, there lies only one 
prime over p and it follows from Dirichlet’s unit theorem that the logarithmic 
map 


A: Ef oO []R 
¢ 


given by € +> (log|@(e)|)g maps EF /{+1} isomorphically to a lattice of rank #G* 
inside [|] és R. Here the products run over the set of embeddings ¢ : Qe YOR. 
The logarithmic map is easily seen to be Z[G* ]-linear. It follows that the R[G* ]- 
module (EF /{+1}) @ R is isomorphic to |], R. 
By Exercise 13.3, the R[G*]-module 


Qé7) eR = []R 
¢ 


is free of rank 1. Therefore, the Gt-modules EX /{+1} and Z[G*] become isomor- 
phic when we tensor with them R. By Lemma 13.6, the Gt -modules ES /{+1} and 
E},/[42p also become isomorphic when we tensor with R. Since all modules are 
Z-free and we have [2) C ES, the result follows from Corollary 13.5. 


Exercises 


13.1 Let I be a finite group and let k be a field whose characteristic does not 
divide #1”. For any character of I”, let k, denote the extension of k that is 
generated by the values of x. Show that 


Yolk, :k] = ar. 


4 


Here the sum runs over the Gal(k/k)-conjugacy classes of the characters 
of I. 

13.2 Let I” be a finite group and g be a prime not dividing the order of I”. For 
every character x of I”, let O, denote the Z,-algebra that is generated by the 
image of I". Let k, denote the residue field of the local ring O,. Show that the 
reduction homomorphism 


Hom(l, 07) — Hom(!, ky) 


is an isomorphism of groups. 
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13.3 


13.5 
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Let K be a finite Galois extension of Q with H = Gal(K /Q). Suppose that K 
is a totally real number field. We let the group H act on 


Hs 


via o(Xg)6 = (Xg.o-1)g- Here the product runs over the various embeddings 
go: K ~ R.In this way, |] » R acquires the structure of an R[H ]-module. 


(a) Show that ITs R is a free R[ H ]-module of rank 1. 
(b) Show that the natural map K —> [|] 5 R given by x +> (@(x))g induces 
an isomorphism K @ R — Ire R of R[H]-modules. 


Let p be a prime and let I” be a group of order p. 


(a) Give an example of a nonsplit exact sequence of F,,[”]-modules. 
(b) Give an example of a finite Z,[J”]-module M with the property that the 
F,[1”]-modules M[p] and M/pM are not isomorphic. 


Show, by means of a computer calculation, that for p = 1129 and q = 3, the 
group E,/E3 is not a free F,[G*]-module. See [42]. 


14 
The Plus Argument II 


In this chapter, we prove Theorem II of chapter 1. The key ingredients are the 
results of chapter 12 and Francisco Thaine’s famous theorem [48], which is proved 
in chapter 16. Let p, g be odd primes and let x, y € Z be a nonzero solution to 
Catalan’s equation x? — y7 = 1. By Lemma 6.1, we have p 4 q. We use the 
notation introduced in Chapter 7. In particular, we let G = Gal(Q(¢,,)/Q), we write 
t for the complex conjugation o_;, and we let Gt = G/(1). 

In this chapter, we write E for the subgroup E, of p-units of Q(¢,)*. Similarly, 
we write C for the the group C, of cyclotomic p-units. The group C, is the multi- 
plicative Z[G]-module generated by | — ¢,,. See Exercise 14.4. 

The obstruction group 


H = {a € Q(é,)* : ord:(w) = 0 (mod q) for all primes t 4 p}/Q(é,)**. 


was introduced in Chapter 7. It is an F,[G]-module. We consider the Selmer group 
S of Chapter 10. It is the F,[G]-submodule of H defined by 


S = {a € H: a@isaq-adic qth power}. 


Here we call a € Q(¢,)* a “q-adic g-th power”, if it is a gth power in the completion 
F, of Q(¢,) at each of the primes q lying over q. 

By Corollary 10.3, the number x — ¢, is contained in S. Therefore, (x — ¢ oe 
is contained in the plus part S* of S. In this chapter, we show that St is annihi- 
lated by some nonzero element of F,[G*]. This then contradicts the fact, proved in 
chapter 12, that (x — ¢,)'*' generates a free F,[G*]-module inside H*. We prove 
our annihilation result under the assumption that p # | (mod q). In this case, g does 
not divide #G* and the group ring F,[G*] is semisimple. Therefore, the theory of 
chapter 13 applies and the results there are used in several places in the proof. 


Theorem 14.1 Suppose that p > q are odd primes and that p ¥ | (mod q). Then 
the F,[G*]-module S* has a nonzero annihilator. 


Proof By E“ we denote the subgroup {e € E : ¢ is a q-adic gth power} of the 
group E of p-units. It contains the subgroup E’% of gth powers. The exact sequence 
of Proposition 7.4 gives rise to the exact sequence 
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O— E®s/ET —» st —>+ ClItIql. 


By C we denote the subgroup of the group C of cyclotomic p-units that consists 
of the cyclotomic p-units that are qg-adic gth powers. It is a Z[G]-module. By 
Lemma 7.1, the group E/E% is an F,[G* ]-module. It admits a filtration 


Oc OWE. C  CRUET ic BJ ES 
—S -— —S—” SS —’” 


E\ En E3 


with successive subquotients E,} = CE%/E%, Ey = CE4/CE4, and E3 = 
E/CE‘, respectively. The natural map E; <> E/E is injective. Its cokernel 
is EM /CE?, Since EY 1 CE? = CE, the natural map EY /C EY —> 
E/CE‘ is injective. Since E/CE’ = E3, we obtain the following diagram with 
exact rows and columns: 


Ey 


aa 


0 — E®/EI — $+ — CI*[q]. 


vv 


EB 


Since q does not divide #G*, Proposition 13.1 applies and every exact sequence of 
F, [G*]-modules splits. Therefore, we have an injection of F,[G* ]-modules: 


st ~~ E, x E3 x CI*[q]. 


Theorem 16.3 below is a special case of Thaine’s theorem [48]. It says that the 
F,[G*]-annihilator of E; = E/CE® also annihilates Cl*/CI*. Since q does 
not divide #G*, Proposition 13.2 implies that the F,[G*]-modules Cl+/CI*‘4 
and C/*[q] are isomorphic. Therefore, the F,[G*]-annihilator of E; = E/CE4 
annihilates C/*[q]. It follows that the F,[G* ]-annihilator of E3 x EF, annihilates 
E, x E3 x Cl*[q] and hence S*. Therefore, we are done if we show that the 
F,[G* ]-annihilator of E3 x Ej is not zero. 

Suppose it is. By Proposition 13.1, all exact sequences of F,[G*]-modules split. 
Therefore, E; x E> x E3 is isomorphic to E/E‘. By Corollary 13.7, the F,[G*]- 
module E/E? is free of rank 1. The fact that the F,[G*]-annihilator of E3 x Ey; 
is zero implies by Exercise 14.1 that the annihilator of FE, is the unit ideal. This 
implies E, = 0. Since E; = CE’/C E4, this means that every cyclotomic p-unit 
is a g-adic gth power. Since p > q, this is impossible by Proposition 14.2 below. 

This proves the theorem. 
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Let p, g be odd primes. When p ¥ 3, it is highly unlikely that all cyclotomic 
p-units in Q(¢,) are g-adic q-th powers. At present, however, this can only by shown 
when p > q. 


Proposition 14.2 Let p, g be odd primes. If every cyclotomic p-unit contained in 
Q(é,) is a q-adic qth power, then one has p < q. 


Proof Let ¢ be a primitive pth root of unity in Q(¢,). If every cyclotomic p-unit 
is a q-adic gth power, then every cyclotomic p-unit is congruent to a gth power 
modulo q?. We have in particular 


c¢4 —1=u! (mod q”) for some u € Q(ep)*. 
From the fact that (¢ — 1)? = ¢% — 1 (mod q), we deduce that 
(¢ —1)4 = u? (mod q). 
Both sides of the equations are gth powers. By Exercise 10.2, the ring Z[¢,]/(q) 
contains no non-zero nilpotent elements, so that Exercise 10.1 implies that the 
congruence also holds modulo q?: 
(¢ — 1) = ¢4 — 1 (mod q’). 
This means that the pth root of unity ¢ is a zero of the “Witt polynomial”: 


(T—1)'-T¢ +1 
W(T) = a FIT]. 


This implies that in the ring F,[T], the polynomial W(T) is divisible by the pth 
cyclotomic polynomial. Inspection of the degrees of these polynomials shows that 
necessarily p — 1 < q — 2, as required. 

This proves the proposition. 


Now we prove Theorem II of Chapter 1. 


Theorem II Let p, gq be odd primes and suppose that x, y are nonzero integers 
satisfying Catalan’s equation x? — y4 = 1. Then we have either q = | (mod p) or 
p = 1 (mod @q). 


Proof By Lemma 6.1, we have p 4 q. By symmetry, we may assume that p > q. 
Then, clearly, gq 4 1 (mod p) holds. Suppose that also p ¥ 1 (mod q). 

Theorem IV implies that we may assume p,q > 7 and that therefore we may 
apply Theorem 12.4. It follows that the F,[G*]-submodule of H* generated by 
(x — ¢,)'* is free. On the other hand, by Corollary 10.3, the element (x — ¢,)'*! 
is contained in the group St and Theorem 14.1 says that St admits a nontrivial 
F, [G*]-annihilator. This contradiction implies Theorem II. 
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The proof of Theorem II uses Theorem IV only when one of p, g is smaller than 7. 
Therefore, one needs only to check the conditions of Theorem IV for the primes 3 
and 5 only. In other words, it suffices to know that the minus class numbers of the 
cyclotomic fields Q(¢,) and Q(¢,) are both equal to 1. 


Exercises 


14.1 


14.2 


14.3 


14.4 


Let R be a ring and let A, B be two R -modules with the property that A x B 
is a free R -module of rank 1. Show that the R -annihilators of A and B are 
coprime ideals. In other words, show that Annr(A) + Annr(B) = R. 

Let g be a prime. Show that the polynomial W(T) of Proposition 14.2 is 
congruent to 


Show that 7? — T + 1 divides the polynomial W(T) = (T—1)4-—T47+1)/qT 
for every prime g > 5. 

Show that the group of cyclotomic p-units C C Q(¢,)* contains 2). Show 
that C/{12p is a free Z[G*]-module of rank | generated by 1 — ¢,,. Here Gt 
denotes the Galois group of Q(¢ ae over Q. 


15 
The Density Theorem 


In this chapter, we prove N.G. Chebotarév’s density theorem (Fig. 15.1). This impor- 
tant result regards densities of certain sets of prime ideals of a given number field. 
Dirichlet’s theorem on primes in arithmetic progressions is a special case. Our proof 
follows Chebotarév’s original strategy. This method does not involve any class field 
theory. The results of this section are classical and can be found in several textbooks, 
e.g. [24]. See also [28]. 

Let F be a number field and let K be a finite Galois extension of F. Let 
G = Gal(K/F). For any prime ideal q of K, we define its decomposition group 
by Gg = {o € G: o(q) = q}. Let p = pM Of denote the prime ideal of F over 
which q lies. The natural homomorphism from G,, to the Galois group of Ox /q over 
O;/p is a well-defined surjective homomorphism. If p is unramified in K C L, this 
homomorphism is also injective. The Galois group of Ox/q over Of/p is cyclic. 
It is generated by the Frobenius automorphism of the finite field Ox /q. This is the 
automorphism given by 


xr xXN® — for x € Ox/q. 


Here N(p) = #(Of/p). The fixed field of the Frobenius automorphism is precisely 
the subfield Of /p. 


Definition Let p be a prime ideal of F that is unramified in the extension F Cc K. 
Let q be a prime of K that lies over p. The Frobenius element gq of q is the unique 
automorphism in G, that induces the Frobenius automorphism of the finite residue 
field Ox /q. 


Since the primes q lying over p are Galois conjugates of one another, the corre- 
sponding Frobenius elements gq are conjugate in the Galois group G. In fact, the 
Frobenius elements constitute a full conjugacy class of the group G. This conjugacy 
class depends only on the prime ideal p, and we call it the Frobenius conjugacy class 
associated to p. The Frobenius conjugacy class of p is the identity if and only if p is 
totally split in K. 
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Fig. 15.1 N. G. Chebotarév (1894-1947). (Reproduced by with the kind permission of the Museum 
of the History of Kazan University, a division of Kazan State University.) 


In the special case where G is abelian, the conjugacy class consists of one single 
element. It is the Frobenius element of any of the primes q that lie over p. In this 
case, we may also call ~, the Frobenius element of p. 

We look in more detail at two special situations. 


Example 15.1 (Cyclotomic extensions) The homomorphism 


(Z/mZ)* —> Gal(QEn)/Q) 


given by x +> oy is an isomorphism. Here o, is the automorphism given by the 
formula o,(¢,,) = ¢;,. The primes p that do not divide m are unramified in the 
extension Q C Q(¢,,). The Frobenius element g, of any prime q lying over an 
unramified prime p is the element o, of the group Gal(Q(¢,,)/Q). It corresponds to 
the congruence class of p in (Z/mZ)*. Since Gal(Q(é,,)/Q) is abelian, the Frobenius 
element depends only on p and not on the prime q lying over p. 

More generally, let F be a number field and m be an integer. Then all prime 
ideals p that do not divide m are unramified in the extension F C F(é,,). The 
restriction map is an injective homomorphism from the Galois group Gal(F'(¢,,,)/F) 
into Gal(Q(é,,)/Q). It is an isomorphism if and only if FN Q(é,,,) = Q. The Frobe- 
nius element of an unramified prime q of F(¢,,,) lying over the prime p of F is the 
element On(p): In terms of the isomorphism Gal(Q(¢,,)/Q) = (Z/mZ)*, the Frobe- 
nius element is the congruence class of N(p) in (Z/mZ)*. As in the case F = Q, 
the Frobenius element depends only on the prime p and not on the primes q lying 
over it. 


Example 15.2. (Kummer extensions) Let m be a natural number and let F be a 
number field containing the mth roots of unity. For any finitely generated subgroup 
A of F*, we write F(4/A) for the field extension of F that is generated by the mth 
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roots of the elements of A. The field F (VA) is a finite extension of F. The natural 
Kummer homomorphism 


Gal(F(/A)/F) —> Hom(A/AN (F*)", tm) 


given byo +> f,, where f(a) = 0(2/a)/ X/a for a € A, is a well-defined isomor- 
phism of groups. See Exercise 9.9. 

The prime ideals p of F that do not divide m or any of the numerators and 
denominators of the elements in A, are unramified in the extension F C F (WA). By 
Exercise 15.1, the number N(p) — | is divisible by m. Since Gal(F(4/A)/F) is 
commutative, the Frobenius elements gq of the primes q lying over p are equal to 
one another. They are equal to the homomorphism A/(A NM F*”) —> [4 that maps 
a € A to the unique root of unity € € «,, for which 


N@)=1 


a” (mod p). 


§ 


Indeed, by definition of the Kummer homomorphism, we have 


Newt Gq 4/a) 
=a 


a 


(mod p) 


for all a € A. This shows that 


gq(s/a) = s/ak = fa-a "= Ya’" (mod q) 


for any prime q lying over p, as required. 


The zeta function ¢;(s) of a number field F admits an Euler product. We have 


-1 
tr(s)= | | (: = ar) for s € C with Re(s) > 1, 
p 


where the product runs over the prime ideals of F. The zeta function ¢f(s) admits 
a meromorphic extension to C with a simple pole at s = 1. See [24]. Therefore, 
Exercise 15.3 implies that for s ¢ R.,, we have 


1 
-~W —— 
Die ( o 


1 
y —— + (5), 
— Nipy ae 


—log(s — 1) 


where g(s) is a function that remains bounded as s approaches 1. Writing h,(s) ~ 
h>(s) when two functions h,, hz : R.; —> C have the property that h\(s) — ho(s) 
remains bounded as s — 1, we have 
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In particular, we have 


lim 
sli <4 N(py 


= +00 


Definition Let F be a number field and S be a set of prime ideals of F. The lower 
density d;,(S) and upper density d}(S) of S are defined by 


ese 
d;,(S) = liminf “8” 
si Tog() 
and 
1 
Dipes N(p)s 


dt (S) = lim sup ; 
. sii log(4) 


respectively. We have 
0 < dp(S) < dp(S) <1. 


If the lower density d;, (S) is equal to the upper density d/'(S), we define the density 
dr(S) by 


dy(S) = d;(S) = d3(S). 


As we explained, the denominator in the definitions of d}(S) and d;,(S) is, up to 
a bounded function, equal to a ao where the sum runs over all primes p of F. 
Therefore, the quantity dr(S) is called a density. It measures, in a certain sense, the 
proportion of primes of F that is in S. Finite sets S have density zero. Exercise 15.6 
implies that for any set S, the set that consists of the prime ideals in S of degree 1 
has the same upper and lower density as S. 

Although we do not need it here, we mention that the natural density d¥""(S) of 


a set of primes S is defined as 


Hp eS: p<X) 


an(S) = jim app SX) 
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The notion of natural density is perhaps closer to our intuitive notion of density. The 
following proposition explains the relation to our notion of density. It is not needed 
for the proof of Chebotarév’s density theorem. 


Proposition 15.3 Let F be a number field and let S be a set of prime ideals of F. 
If the natural density dj“"(S) exists, then so does the density dr(S). In addition, they 
are equal. 


Proof For any X € R, let 7(X) denote the number of prime ideals of F of norm 
< X and let 25(X) denote the number of prime ideals in S of norm < X. We have 


ad, (8) = lim 


We write d = d?"(S) and let e > 0. Then there exists Xo so that for all X > Xo, we 
have 


|s(x) — dm(X)| < em(X). 


Let s € R,s1. We have Xs my = Visi GE — gop) and, similarly, 
pes Wey = Lune TSM Gr — Gapp)): It follows that 


1 1 
= Yrs) —dm(n)) (= < (n+ =) : 


n>1 


1 1 
Noy a, (py: 


pes 


We split the sum over n into two parts: a sum over < Xo and one overn > Xo. 
This leads to 


1 1 
Do wey a Ney] = O%O +L py ar 


peS 


Finally, we divide by ar WF and let S tend to 1. We find 


Since this inequality holds for every ¢ > 0, the result follows. 


Our proof of Chebotarév’s density theorem starts off by proving a special case. 
It is the generalization of Dirichlet’s theorem on primes in arithmetic progressions 
to an arbitrary number field F. 


Proposition 15.4 Let F be a number field and m be a natural number. Let H be the 
Galois group of F(¢,,) over F. Then for every h € H, the density of the set of prime 
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ideals p of F for which the Frobenius element is h, is equal to 1/#H. In particular, 
for every h € H, the set of such prime ideals is infinite. 


Proof The restriction map identifies the Galois group of F(¢,,) over F with a 
subgroup H of (Z/mZ)* = Gal(Q(ém)/Q). For every prime ideal p of F,, its Frobe- 
nius element is N(p) € H Cc (Z/mZ)*. Here the norm is taken from F to Q. 

For every character x : H —> C%, we define the Dirichlet L-series L(s, x) by 


x(NU)) 
L(s, x) = : 
xX aly 


x(N(p))\! 
= —_ 
A eer 


p 


for s € C with Re(s) > 1. Here the sum runs over all nonzero Of-ideals J and the 
product runs over the primes p of F. By convention, we put x(k) = O whenever k 
is an integer that is not prime to the conductor of x. See Exercise 15.7. The fact 
that the L-series L(s, x) admits Euler product expansions follows from the fact that 
every fractional F-ideal is in a unique way a product of prime ideals. We have 


Cres) = | [ Ls, x), 
x 


where the product runs over the characters x of H. The L-series corresponding to 
the trivial character is precisely the zeta function of the field F. By Exercise 15.8, 
the L-series corresponding to the nontrivial characters converge for s € R.,. Since 
both zeta functions ¢;(s) and ¢p¢,)(s) have simple poles of order lat s = 1, we 
have 


Ld, x) 40 for each nontrivial character x. 


Taking logarithms, we see that for each nontrivial character x, the sum 


5 x) 
— Nw 


remains bounded as s —> 1. 
Finally, for each h € H, we consider the functions f),(s) on R,.; given by 


N 
fils) = > x ay>> a (s € C, Re(s) > 1). 
x p 


Here the sum runs over the characters of H. Up to a function that remains bounded 
as s — 1, the function f),(s) is equal to the L-series corresponding to x = 1. Since 
the latter function is the zeta function of F, we have 
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1 
Sn(s) ~ log (—) : 


Changing the order of summation, we find 


> 


(N(p)h7') 
Sis) ~ > Lx XNA) 


Ny 


which, by the orthogonality relations for characters, becomes 
1 #H 
log | —— ] ~ : 
s—l N(py 


p 
N(p)=h (mod m) 


It follows that the density of the set of prime ideals p satisfying N(p) = h (mod m) 
is 1/#H, as required. 


The key point is to prove Chebotarév’s density theorem for cyclic extensions. 
The next proposition takes care of this case. 


Proposition 15.5 Let F be anumber field and F C K be acyclic Galois extension. 
Put G = Gal(K/F). Then for every o € G, the set S of prime ideals p of F with 
Frobenius element equal to o has a density. It is given by 


1 
dr(S)= ¥G. 


Proof Leto € G and q be any prime number that is not ramified in the extension 
Qc K. Let ¢ denote a primitive gth root of unity. Since K N Q(¢) = Q, the 
Galois group H of F(¢) over F is isomorphic to (Z/qZ)*, and the restriction 
map from Gal(K(¢)/F) to Gal(K /F) x Gal(F(¢)/F) identifies Gal(K (¢)/F) with 
Gx H. 


K(é) 

K L F(¢) 

I~ Je 
F 


Let S be the set of prime ideals of F whose Frobenius element is equal to o in G. 
Then we have 
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S= U §,, 


teH 


where S, denotes the set of prime ideals of F whose Frobenius element is equal to 
(o, t) € G x H. Since at this point in the proof we do not know that the sets S and 
S, actually have well-defined densities, we work with lower densities. The latter 
always exist. By Exercise 15.4, we have 


dz (S) = Y° dz (Sr). 


teH 


Now let n = #G and suppose that the prime g satisfies g = 1 (mod n). Fix t € 
H = (Z/qZ)* of order divisible by n. Then the subgroups ((o, t)) and G x {1} 
of G x H have trivial intersection. Therefore, the composite of the corresponding 
fields of invariants is equal to K (¢). Denoting the invariant field of ((o, t)) by L, this 
means precisely that L(¢) = K(¢), so that the extension L C K(¢) is cyclotomic. 
By Proposition 15.3, the density of the set of primes T of L with Frobenius element 
in Gal(K(¢)/L) C G x H equal to (0, T) exists; it is given by 


1 


d,(T) = ———-—_-. 
Y= Oso 


The subset of T consisting of the primes of degree 1 over L has the same density. 
Therefore, we have d,(T) = d, (T) = [L : F]d,(S;,). It follows that the density 
of the set S; of primes p of F with Frobenius element in G x H equal to (0, T) is 
given by 


1 1 
[L: FJ[K(£):L] #G-#H° 


dp (St) = 


Summing over all automorphisms t € H of order divisible by n, one obtains an 
estimate for the lower density of the set S. It is given by 


dz(S) > #H, 

pO #G#H" 
where H,, is the set of t € H of order divisible by n. See Exercise 15.4 Now 
we apply Proposition 15.3 to the field Q. This is Dirichlet’s theorem on primes in 
arithmetic progressions. It ensures that for every k > 0 there exists a prime number 


q = 1 (mod n*) that is unramified in K. By Exercise 15.5, the number of elements 
in H = (Z/qZ)* of order divisible by n* is 


1 
(q- vy] (1 ~ eres | ‘ 


p\n 
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This implies that for every k > 0, we have 


#H, 1 
= Ps 
#H ( Fa) 


p\n 


Now the main result of this chapter follows. By letting & tend to infinity, we see that 
d,(S) = : 
ee #G° 


Applying this to all other elements of the group G, one finds that the upper density 
as ) is at most 1/#G. Therefore, the lower and the upper densities coincide, and 
we have 


1 
dr(S) = ¥G" 


This completes the proof of the theorem. 


Theorem 15.6 (Chebotarév’s density theorem) Let F be anumber field and F C K 
be a finite Galois extension. Set G = Gal(K/F). Then for every conjugacy class 
C of G, the set S of prime ideals p of F whose Frobenius conjugacy class is C has 
density given by 


#C 
dp(S) = ge. 


Proof Leto € C, and let E C K be the fixed field of o. Then K is a cyclic Galois 
extension of E with Galois group (a). By Proposition 15.5, the density theorem 
holds for the extension E C K. This means that 


1 
Way? 1 

ary = tim Het Mar af 
sl log(=j) f 


Here F is the order of o € G and T denotes the set of prime ideals q of E whose 
Frobenius element in Gal(K /E) is equal to o. We replace T by its subset of primes 
of degree 1. This does not affect the density dz(T). 


Let p be a prime in S. There are #G/f primes q of E lying over p. Their Frobe- 
nius elements g, in Gal(K /F’) are all conjugate and are therefore contained in C. 
Therefore, for every o’ € C, the number of prime ideals q of K that lie over p 
and whose Frobenius element is o’ is the same. It follows that this number is equal 
to #G/(f#C). This holds in particular for o itself. Since for every prime in E over p 
there is precisely one prime q of K lying over it, we see that there are also #G/(f#C) 
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prime ideals of E lying over p with Frobenius element equal to 0. From this and the 
fact that p splits completely in E, we deduce that 


1 f#c 1 
2 Ny 7 #6 2 N@ 


pes 


Letting s tend to 1, we find 


iss f#C 1 _ #C 


dry =e a= FG’ 


as required. 


Exercises 


15.1 


15.2 


15.3 


Let m be a natural number and let F be a number field containing the mth 
roots of unity. Let p be a prime ideal of F that does not divide m. 


(a) Show that the reduction map “4, —> (Of/p)* is an injective group 
homomorphism. 
(b) Deduce that m divides N(p) — 1. 


Let K = Q(¢3, V2). The Galois group G of K over Q is generated by the 
automorphisms t and o of order 2 and 3, reSpecuvely. It is isomorphic to the 
symmetric group 53. Here t is given by t(¢3) = ¢3 "and t(\/2) = V2, while 
o is given by o(¢3) = 3 and o(\/2) = 63/2. The ring of integers of K is 
given by Ox = Z[6&3, V2]. 


(a) Show that the primes of K that lie over 7 are the ideals q = (¢3 — 2) and 
a a — 2). Show that G, = Gj: is equal to (oc) and that the Frobe- 
nius element ~ is equal to o” while Gq = o. The Frobenius conjugacy 
class of the prime 7 is therefore equal to the conjugacy class {o, 0~'} 
of G. 
Show that the primes of K that lie over 5 are the ideals q = (/2 — 3), 
q’ = (€3</2 — 3), and q” = (¢;'s/2 — 3). Show that the groups G;, Gv, 
and G,” are equal to the cyclic groups of order 2 generated by t, to, and 
to!, respectively. Determine the Frobenius elements of q’, q’, and q”. 
Show that the Frobenius conjugacy class of the prime 5 is the conjugacy 
class {t, ta, ta ~!} of G. 


Let F = Q(V2, V3). The Galois group G = Gal(F/Q) is generated by o 
and t, where a(V2) = J/2 and o(/3) = —V3, while t(/2) = —/2 and 
t(/3) = V3. The group G is isomorphic to the Klein 4-group. The ring of 
integers of F is given by Or = Z[V2, V3]. 


(b 


wm 
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(a) Show that the primes of F that lie over 5 are the ideals q = (/6 — 1) and 

= (/6 + 1). Show that G, = Gy = (ot). Determine the Frobenius 
elements of q’ and q’. 

(b) Show that the primes of F that lie over 7 are the ideals t = (/2 — 3) and 

= (/2 + 3). Show that Gg = Gq = (a). Determine the Frobenius 
elements of q’ and q’. 


15.4 Let ay, b, be two sequences of real numbers. Show that 
liminf(a, +b,) >  liminfa, + liminfb,, 
n->oo noo noo 


n—->O©o noo noo 


limsup(a, +b,) <  limsupa, + limsupb,. 


15.5 Let H be acyclic group of order m and let d be a positive divisor of m. Show 
that the number of elements in H of order divisible by d is equal to 


mT] (1- saat) 


p\d 


where here a, = ord,(m) and b, = ord,(d). 
15.6 Let F be a number field. 


(a) Show that the series 


1 
log(1 — — 
2 ( a0 - a+ TO) 


converges absolutely. Here the summation runs over the prime ideals p 
of F. 
(b) Show that the series 


1 
a N(p) 


deg(p)>=2 


converges absolutely. 


15.7 Let m be a natural number with m # 2 (mod 4), let H be a subgroup of 
(Z/mZ)*, and let HY denote its group of characters xy : H —> C*. 


(a) Leth € H. Show that eH’ x(h) = #H when h = 1, while it is 0 
otherwise. Let y € HY. Show that )°,,.,, x(h) = #H when x = 1, 
while it is 0 otherwise. 

(b) The conductor of a character x € H” is the smallest positive divisor f 
of m for which {x € H : x = 1 (mod f)} is contained in the kernel 
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of x. Determine the number of characters of conductor m for 
H = (Z/mZ)*. 
15.8 (Dirichlet’s criterion) Let a, € C be a sequence with the property that 


yA dy is bounded as N — oo and let b, € R be a sequence that tends 
monotonically to 0 as n — oo. Show that }°>°, dnbn converges. 


15.9 Let 1 < a < 9 bea natural number. 


(a) Show that the set S, of prime numbers whose first (i.e., most significant) 
decimal digit is equal to a does not have a natural density. 
(b) Show that the Dirichlet density of S, is given by 


log(a + 1) — log(a) 


da(Sa) = log(10) 


(Hint: Use the prime number theorem) 


15.10 Let Gbea finite group and leto € G have order f: Let C denote the conjugacy 
class of o. Show that #C - f divides #G. 


15.11 Let K = Q(é3, /2). Determine the Frobenius conjugacy classes of the Galois 
group of K over Q of the primes p satisfying 5 < p < 50. 


16 
Thaine’s Theorem 


In this chapter we prove an important special case of Thaine’s theorem [48]. 

Let p be an odd prime. Let ¢, denote a primitive pth root of unity and let 
oy =6,+6, ' We adopt the convention of chapter 14. This means that we write 
Cl and CI* for the ideal class groups Cl, of Q(¢,) and C Le of Q(é a ), respectively. 
We write for E for the group E,, of p-units in Q(¢,) and write C for its subgroup C, 
of cyclotomic p-units. 

Let G = Gal(Q(¢,)/Q) and 1 € G denote the automorphism that induces 
complex conjugation. We write F = Q(é A ) and let Gr = G/(.). We have 
Gt = Gal(F/Q). By E* and Ct, we denote the groups of -invariant p-units and 
.-invariant cyclotomic p-units, respectively. By Exercise 16.1, the natural homomor- 
phism 


Et/Ct —> E/C 


is an isomorphism of Gt-modules. 
We have the following “plus” version of Proposition 7.5. As usual, hy denotes 
the order of the ideal class group Cl* of Q(¢>). 


Proposition 16.1 Let p be an odd prime. Then the group E/C is finite of order se 


Proof This result goes back to Kummer. Since we do not need it, we merely sketch 
the proof. See [50, Ch. 4] for more details. The key point is the fact that the zeta 
function ¢f(s) is a direct product of the L-series L(s, x), where x runs over the 
characters of G*. For the nontrivial characters x of G = Gal(F/Q), we have 


p-1 


1 4 nia 
EO = Zeon 2X (a)log|1—e > |, 


while for x = 1, the L-series L(s, x) is equal to the Riemann zeta function and has 
a simple pole of order | at s = 1. Confronting this with the formula for the residue 
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of ¢-(s) ins = 1 and using the fact that the Gaussian sum t(x~') has absolute value 
a/P, we find 


i= 3 2nia 
oR = ees: \(a)log|l1—e? |. 


x41 a=! 


Here R denotes the regulator of F. Like the regulator, the right-hand side of this 
formula is also equal to a determinant of logarithms of absolute values of units. In 
this case, cyclotomic units are involved. The quotient of the two determinants is 
equal to the index [Et : Ct] =[E: C]. 

This proves the proposition. 


It is known that the finite groups C/* and E/C are, in general, not isomorphic as 
Z[G*]-modules or even as groups. Indeed, for p = 62501, the 3-parts of C/* and 
E/C are non-isomorphic groups [50, p. 146]. However, it follows from the results 
of Mazur and Wiles [30] and of Kolyvagin [22] that for any prime q not dividing 
p-—1, the q-parts of C/+ and E/C are Jordan—Hélder isomorphic Z,[G* |-modules 
[50, Thm. 15.7]. Using the terminology of chapter 13, this is equivalent to saying 
that for every character x of G*, the x-parts of the g-parts of C+ and E/C have 
the same order. Thaine’s theorem [48] complements this statement. It says that the 
Z[G*]-annihilator of the g-part of E/C also annihilates the g-part of Cl*. 


Fig. 16.1 Francisco Thaine. (Reproduced with the kind permission of Francisco Thaine). 
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We prove a somewhat weaker version of Thaine’s theorem. The following lemma 
is the key ingredient. 


Lemma 16.2 We use the notation introduced above. Let p and q be distinct odd 
primes with q not dividing p — 1. Let C be a class in Cl* /CI*‘. Then there exists a 
prime ideal lof F = Qc) of degree I with the following properties: 


e [lies over a prime number | that is congruent to 1 (mod pq); 
e the class of \in CI*/CI*4 is C; 
e the natural map 


Et/E** —+ (Of/lOr)*/(Or/lLOr)*4 


is an isomorphism of F,[G*]-modules. 
Before proving the lemma, we show how it implies Thaine’s theorem. 


Theorem 16.3. (F. Thaine, 1988) Let p and q be distinct odd primes with q not 
dividing p — 1. Let G* be the Galois group of F = Qé7) over Q. If 6 € Z[G*] 
annihilates E/C E4, then it also annihilates CI*+ /CI*‘. 


Proof Suppose that @ annihilates E/C EE? and let c € CI*. We have to show that 6 
annihilates C in the group C/+ /CI*“. Let | be a prime of F(¢,) in the class C, that 
has the three properties listed in Lemma 16.2. 

Since q does not divide p — 1, the ring F,[G*] is semisimple. By Corollary 13.7, 
the group E+ /E* is a free F,[G*]-module of rank 1. Let u € Et be a generator. 
By Exercise 16.1, the natural map E*/E*4 —> E/E‘ is an F,[G*]-isomorphism. 
Therefore, u also generates E/E4 as well as its quotient E/CE? = E+/CtE*‘. 
We have 


u’ =yv? forsome y € Ct andve E*. 


Claim There exists a unit ¢ € Q(¢p, ¢) for which 


N(e) = 1, 
€ =y (mod (¢ — 1)). 


Here the norm is taken from Q(¢,, ¢) to Q(ép). 
Indeed, when we have y = ¢; — | for some a € Z, just take ¢ = ¢) — @. It 
clearly has the property e = y (mod (¢ — 1))). Since / = 1 (mod p), we have 


aly 
Ne) == Fah 
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so that ¢ has the second property as well. Since any y € C is a product of the p-units 
¢), — 1, we can take for ¢ suitable products of the units (7 — ¢. 


Let t denote a generator of Gal(Q(¢p, ¢)/Q(Cp)). It fixes ¢, and raises ¢ to 
the sth power, where s is some primitive root modulo /. By Hilbert’s Theorem 90, 
there exists a € Q(¢,, ¢))* with the property that t(@)/a = e. See Exercise 16.2. 
The ideal generated by @ is t-invariant. The extension Q(¢,) C Q(fp, %) is 
unramified outside the primes that lie over /. Moreover, the primes that lie over 
1 are totally ramified. Since the primes over / are conjugate under the action of 
G = Gal(Q(ép, 6:)/Q(é)), we have the following equality of Q(¢,, ¢)-ideals: 


—-l)=[[om. 


oeG 


Here £ denotes a prime of Q(¢,, ¢7) over I. 
Exercise 16.4 implies 


(@@)= J] [ om”. (x) 


ocEG 


Here the exponents r, are in Z and J is a Q(¢,,)-ideal that is prime to /. The coeffi- 
cients r, are related to the cyclotomic unit y as follows. For each o € G7, consider 
the element a/(¢; — 1)’’. Its denominator does not involve the prime ideal o(). 
Since o(l) is totally ramified in F(¢,), the automorphism 1 acts trivially 
modulo o(£). Therefore, we have 


g ad Ea 
~ = d o(£)). 
(fi a4 ly T (= = =z) 3 = Ie (mo o( )) 


=s5 (mod o(£L)), we obtain 


ial 


Since -& 
o 1 


(04 Ea 
= Te d oa(L)). 
Gabe Gap eee 


Therefore, we have 
y =e =s" (mod o(L£)) 
for every o € G. Since both y and s”’ are contained in the field F, we find 
y =s'? (mod o(N) 


for allo € G. Since: fixes the prime ideal [, we have the same congruence for t(/). 
It follows that r, = 7,, for allo € G. This shows that r, depends only on the image 
ofo € Gt. 
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Since / splits completely in F, the ring Of /1O; is the product over o € Gt of 
the residue fields k,(,) of the primes o((). Therefore, the F, [G* ]-module 


(Or/1Or)*/(Or/LOry! = |] Ko /kew! 


oeGt 


is free of rank 1. Since the residue fields k,(1) are isomorphic to F/, the image of the 
cyclotomic unit y is equal to )>, r,o applied to a suitable F,[G*]-generator a. By 
the third property of the prime ideal I, the image of the element u also generates the 
rank | free F,[G*]-module (Or/lLOr)*/(Or/1Opr)*’. Therefore, we have i = u" 
for some unit 7 of the group ring F,[G*]. It follows that 


u? =y= yi drua Toe 


in (O-/lO;r)*/(O-/lO;-)*?. This implies that pe roo = @ in the group 
ring F,[G*]. 

Taking norms from Q(¢,, ¢) to F of the relation («), we find the following 
equality of F-ideals: 


(Na) = J! TT of” 


oeGt 


for some F-ideal J. It follows that the element )° G+ oo annihilates the class c of 
[in the group C/+/CI*‘. Therefore, 9 = 1 )-, roo also annihilates the class c in 
the group C/*+/CI**. This proves the theorem. 


The proof of Lemma 16.2 is an application of class field theory and Chebotarév’s 
density theorem. We recall the following basic fact from class field theory. We do 
not provide any proof of this result here. Readers interested in the proof are advised 
to study class field theory [10, 24]. 


Fact from class field theory Let K be a number field and K denote an algebraic 
closure. The Hilbert class field H(K) of K is the maximal extension of K inside K 
that is abelian and unramified at all primes of K. The Hilbert class field is a finite 
extension of K. It has the property that the map 


Clk —> Gal(H(K)/K), 


which sends the class of a prime ideal p to its Frobenius element @, in the group 
Gal(H(K)/K), is a well-defined isomorphism of groups. 
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H(K) 


L 


if the field K is itself a Galois extension of a number field L with Gal(K/L) = A, 
then the class group Clx is a Z[A]-module. From the maximality of the Hilbert 
class field, it follows that L C H(K) is a Galois extension so that the group 
Gal(H(K)/K) acquires the structure of a A through conjugation. In this situation, 
the homomorphism 


Cle —> Gal(H(K)/K) 
is a Z[A]-isomorphism. 


Examples Let K = Q(¢,,). Then H(K) = K(q@), where a satisfies o—at+1=0. 
The Hilbert class field of Q(¢29) is the degree 8 extension of Q(¢29) generated by the 
square roots of zeroes of the polynomial x? + 13x® + 31x> — 9x4 — 54x? — 3x? + 
23x — 1. 

We set up the notation that we use in the proof of Lemma 16.2. Suppose p and g 
are odd primes with g not dividing p — 1 and consider the diagram of number fields 
below. The restriction map 


Gal(F (fq )/Q(Sq)) — G* 


is an isomorphism of groups. Let A denote Gal(F(¢,)/F). Then the Galois group 
Gal(F (¢,)/Q) is isomorphic to Gt x A. Since the group ET C F(¢,)* is stable 
under the action of G* x A, the field F (fq; Et ) is a Galois extension of Q. 
By Exercise 16.3, the natural map from E*/E*‘ to the group F(¢,)* modulo qth 
powers is injective. Therefore, Kummer theory implies that the homomorphism 


Gal(F(¢,, VE*)/F(éq)) —> Hom(Et/E*, pq) 


given by o +> gq where g,(e) = o(e)/é, is an isomorphism. The group Gt x A 
acts on M = Gal(F(¢,, / E+)/F(é,)) by conjugation. On the other hand, it acts on 
Hom(Et /E*?, wg) via 


a(f)(a) = o(f(o~(a))) for alla ¢ Gt x Aanda eS. 
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These two actions are compatible. In other words, the “Kummer” isomorphism that 
we describe here is an isomorphism of Z[G* x A]-modules. See also Exercise 9.9. It 
follows that the group G* acts on Gal(F (Sq; LET )/ F (G,)) through its natural action 
on E*/E**, while A acts on Gal(F(é,, E*)/ F (&q)) through its action on group 
[Aq of qth roots of unity. We have 5(¢,) = 6°) for 5 € A. Here w : A —> (Z/qZ)* 
is called the Teichmiiller character. 


HG, E*) 


H(G,) FG, NE*) 
M 


Nees KG 
A 
F 


Q¢,) 


f 

\4% | 
Let H denote the subfield of the Hilbert class field H(F) that corresponds 
via Galois theory to the quotient group C/+/CI*? of Cl+ = Gal(H(F)/F). It 
is the maximal abelian extension of F of exponent g that is contained in H(F). 
The field H(¢,) is a Galois extension of Q and G* x A acts by conjugation on 
Gal(H(¢,)/F (,)) = Gal(H/F). This means that A = Gal(F(¢,)/F) acts trivially. 
Since A acts through the Teichmiiller character on M = Gal(F(,, E*) /F (Sq))s 
it follows that H(¢,) and F(é,, </E*) are linearly disjoint extensions of F(¢,)). 

Therefore, we have the following isomorphism of Z[A x Gt ]-modules: 


bt 


Gal(H(¢,, V E+)/F(6q)) = Gal(H/F) x Gal(F (fq E*)/F (Sq) 
= ci /ci**? x M. 
The quadratic extension Q(¢,) is linearly disjoint from H(¢,/ E+) over F. There- 


fore, the Galois group of H(¢p, ¢, Et) over F(¢,) is naturally isomorphic to the 
product of Gal(H(¢,/ E*)/F(¢,)) and Gal(Q(¢,)/F) = (1). 


Proof of Lemma 16.2 Since q does not divide p—1, the ring F,[G* ] is semisimple. 
By Corollary 13.7, the group E*/E*‘ is a free F,[G*]-module of rank 1. By 
Exercise 16.5, same is true for the F,[G*]-module M = Hom(E*/E*“, j1,). Let 
m be a generator of the F,[G*]-module M and consider the element 


(c,m, id) € Gal (H(¢,, /E*)/F(6,)) = Cl*/CI*4 x M x (i). 


By Chebotarév’s density theorem (Theorem 15.4), the set of primes £ of F(¢,) 
whose Frobenius element is equal to (c, m, id) has positive density. Therefore, there 
exists a prime ideal £ of F(¢,) of degree | with this property. We take for | the prime 
ideal of F over which £ lies. 
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We need to show that [ has the properties required by Lemma 16.2. Since £ is 
of degree 1 in F(f,) = Q(p, fq), the prime number / over which £ lies has the 
property that / = | (mod pq). The Frobenius element of the prime [ in Gal( H/F) 
is the restriction of the Frobenius element of £ in Gal(H(¢,)/F (¢,). Therefore, it is 
equal to c. 

To check the third property, consider the homomorphism 


E*/E*? —+ (On/lOp)*/(Or/lOr)"*. 


We claim it is injective. Indeed, suppose that ¢ € Et is in the kernel. Since / is 
totally split in F, the ring Of //O; is a product of the residue fields k, (1) and € is a 


qth power in Kw for every o € G*. Therefore, 


N(O-1 


€ ¢ =1 (mod o()) 


for allo € Gt. The Kummer maps fg) : E*/E*? —> pq are given by foy(e) = 
o(V/e)//e for all e € E*. Since we have 


H-1 


o(WJe)/e =e * (mod a(0) foralle € Et, 


we see that fo(y(€) = 1 (mod o(f)) and hence f(y (€) = 1 for alle € E*. Since f; 
is the Frobenius element associated to the prime ideal [ and o( f{) = fo) for every 
o €G*, the element f; generates Hom(E*/E*“, u,) over the group ring F,[G*]. 
It follows that f(¢) = 1 for every homomorphism f € Hom(E+/Et‘, 1,). This 
means that ¢ is a gth power and injectivity follows. 

Both groups E+ /E+? and (Or/1Or)*/(Or/1Or)*4 are Fy-vector spaces of 
dimension (p—1)/2. As a consequence, the injective homomorphism E+ / E+? —> 
(Or/lOr)*/(Or/LOp)* is an isomorphism. 

This proves the lemma. 


One can prove a theorem that is stronger than Theorem 16.2. We quote the 
following result, a proof of which can be found in [50, Prop. 15.2]. 


Theorem 16.4 Let F be a real abelian number field. Let G = Gal(F/Q), let E 
denote the group of units of the ring of integers of F, and let C denote the subgroup 
of cyclotomic integers as defined in [50, p. 15.2]. Let p be a prime not dividing the 
index [F : Q]. If @ € Z[G] annihilates the p-part of E/C, then 20 annihilates the 
p-part of the class group of F. 


Exercises 
16.1 Let p be an odd prime. We use the notation introduced in this chapter. 


(a) Show that the natural homomorphism E+/Ct —> E/C is an isomor- 
phism of Z[G*]-modules. 


16 Thaine’s Theorem 115 


16.2 


16.3 


16.4 


16.5 


16.6 


(b) For every odd prime g # p, show that the natural map E*+/E+4 —> 
E/E4 is an isomorphism of Z[G*]-modules. Similarly, the natural map 
C/C4 —> Ct/C*4 is a Z[G* ]-module isomorphism. 


Prove Hilbert’s Theorem 90: Suppose that K C L is a cyclic Galois extension 
with Gal(L/K) generated by t. Then for every x € L with N(x) = 1, there 
exists y € L such that x = t(y)/y. 


Let p,q be odd primes. Show that the natural map 


QE /QG HI — QEp)"/QWEp)"™4 
is injective. 


Let K Cc L be a Galois extension of number fields. Let G = Gal(L/K) and 
let 7 be a G-invariant L-ideal. Show that 


1=J]]»” 
p 


for some K-ideal J. Here ay € Z and the product runs over the prime ideals p 
that are ramified in the extension K C L. 


Let I’ be finite abelian group and let k be a field. Establish the “Goren- 
stein property” of the group ring A[I']: show that Hom,(k[I'], k) is a free 
k[I']-module of rank 1. Here the action of T on Hom;(k[V'], k) is defined by 
g(fy(x) = f(g-x)forg €Tandx € kl). 


Let p and q be odd primes and suppose that / is a prime congruent to | (mod q) 
that splits completely in F = Q(¢>). 


(a) Show that the prime ideals of F that divide / are conjugate to one another 
under the action of Gt = Gal(F'/Q). Show that their residue fields are all 
equal to the prime field F/. 

(b) Show that the F, [G*]-module 


[ [ater 


U1 


is free of rank 1. Here [runs over the prime divisors of / in F and k; denotes 
the residue field of [. 


Appendix. Euler’s Theorem 


Here we present Euler’s 1738 proof of the fact that the only nonzero rational solu- 
tions to the Diophantine equation X* = Y? + 1 are given by X = +3 and Y = 2. 
The proof is by a descent argument. 


Fig. A.1_ Leonhard Euler (1707-1783) 


Euler supposes that X, Y are nonzero rational numbers satisfying X* = Y* + 1, 
lets Y = Y’—1, and supposes that Y’ = c/b for coprime integers b, c € Z. Clearing 
denominators, this means that 
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be(c* — 3bc + 3b”) isa square > |. 


The nontrivial solution corresponds to c = 3, b = 1. We want to show that it is the 
only one with gcd(b, c) = 1. Since replacing c by 3c’ leads to the same condition 
with b,c replaced by c’, b, respectively, it suffices to show that the equation has no 
solution in nonzero coprime integers b,c with c 0 (mod 3)). Euler assumes there 
is a solution and performs a “descent”. This means that from the solution X,Y, he 
produces a new, smaller solution to the equation. Indeed, following his steps, we see 
that near the end of his proof he writes that tu(u* — 3ut + 317) is a square exceeding 
1. Ina precise sense, this solution is strictly smaller than the original solution. Since 
this process cannot continue forever, there cannot be any solution at all and the result 
follows. 


THEOREMA 10 


Nullus cubus, ne quidem numeris fractis exceptis, unitate auctus quadratum efficere potest 
praeter unicum casum, quo cubus est 8. 


DEMONSTRATIO 


Propositio ergo huc redit, ut S + 1 numquam esse possit quadratum praeter casum, quo 
% = 2. Quocirca demonstrandum erit hanc formulam a’b + b* nunquam fieri posse 
quadratum, nisi sit a = 2b. 

Haec autem expressio resolvitur in istos tres factores b(a + b)(aa — ab + bb), qui primo 
quadratum constituere possunt, si esse posset b(a +b) = aa —ab+bb, unde prodit a = 2b, 
qui erit casus, quem excepimus. Pono autem, ut ulterius pergam, a+ b= c seua =c—b, 


qua facta substitutione habebitur 


bc(cc — 3bc + 3bb), 


quam demonstrandum est quadratum esse non posse, nisi sit c = 3b; sunt autem b et c 
numeri inter se primi. Hic autem duo occurrunt casus considerandi, prout c vel multiplum 
est ternarii vel secus; illo enim casu factores c et cc — 3bc + 3bb communem divisorem 
habebunt 3, hoc vero omnes tres inter se erunt primi. 

Sit primo c non divisibile per 3; necesse erit, ut singuli illi tres factores sint quadrata, 


scilicet b et c et cc — 3bc + 3bb seorsim. Fiat ergo cc — 3bc + 3bb = (2b cy; erit 


n 


b 3nn — 2mn b 2mn — 3nn 
= vel = 


c 3nn—mm’ Cc omm—3nn’ 


cuius fractionis termini erunt primi inter se, nisi m sit multiplum ternarii. Sit ergo m per 
3 non divisibile; erit vel c = 3nn — mm vel c = mm — 3nn et vel b = 3nn — 2mn vel 
b = 2mn — 3nn. At cum 3nn — mm quadratum esse nequeat, ponatur c = mm — 3nn, quad 


quadratum fiat radicis m — Fn, hincque oritur = = “agree atque 
b 2m 3 3qq — 3pq + pp 
mon = Pq : 


Quadratum ergo esset haec formula pq(3qq—3pq+ pp), quae omnino similis est propositae 
bc(3bb — 3bc + cc) et ex multo minoribus numeris constat. At sit m multiplum ternarii, puta 


m = 3k; erit 2 = nnn , unde erit vel c = nn—3kk vel c = 3kk —nn; quia autem 3kk —nn 
quadratum esse nequit, ponatur c = nn — 3kk eiusque radix n — tk unde fiet 2 = “ager 
k 2pq 
seu = = atque 
n 3qq+ppP q 
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b 1 2k — pp +3qq —4pq 


nn n 3qq + PP 


Quadratum ergo essere esse deberet (pp + 3qq)(p — q)(p — 3q). Ponatur p — gq = ¢ et 
p—3q =u;eritd = S* et p 3!" iHaque formula abit in hanc tu(3tt — 3tu + uu), quae 
iterum similis est priori bce(3bb — 3bc + cc). 

Restat ergo posterior casus, quo est c multiplum ternarii, puta c = 3d, atque quadratum 
esse debet bd(bb — 3bd + 3dd); quae cum iterum similis sit priori, manifestum est utroque 
casu evenire non posse, ut formula proposita sit quadratum. Quamobrem praeter cubum 8 
alius ne in fractis quidem datur, qui cum unitate faciat quadratum. Q. E. D. 


Leonhardi Euleri: Theorematum quorundam arithmeticorum 
demonstrationes, in COMMENTATIONES ARITHMETICAE, 
Volumen I, Ed. Ferdinand Rudio, Lipsiae et Berolini, 
B.G. Teubner, MCMXV. 
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